Identifying, recording and reproducing information

ABSTRACT

A method and apparatus of recording video and/or audio material on a recording medium, the method including an identifier for identifying a piece of video and/or audio material recorded on the recording medium, an activity indicator representative of relative activity within a content of the video and/or audio material, and metadata in accordance with the activity indicator.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a Divisional Application of U.S. application Ser.No. 10/016,828, filed on Dec. 4, 2001, and is a continuation of andclaims the benefit of priority to International Application No.PCT/GB01/01458, filed on Mar. 30, 2001 and from the prior British PatentApplication Nos. 0008436.8 filed on Apr. 5, 2000, 0008426.9 filed onApr. 5, 2000, and 0008398.0 filed on Apr. 5, 2000. The entire contentsof each of these documents are incorporated herein by reference.

FIELD OF THE INVENTION

A first aspect of the present invention relates to identifying videoand/or audio and/or data material.

A second aspect of the present invention relates to digital video taperecording.

A third aspect of the present invention relates to recording apparatusand methods of recording audio and/or video information signals. Moreparticularly the third aspect relates to recording apparatus and methodsof recording audio and/or video information signals onto a linearrecording medium. Correspondingly, the third aspect also relates toreproducing apparatus and methods of reproducing audio and/or videoinformation signals and in particular information signals recorded ontoa linear recording medium.

Material, which may be any one or more of audio, video and data, isrepresented by information signals which are preferably digital signals.

First Aspect of the Invention.

Description of the Prior Art

It has been proposed to identify video and/or audio material and/or datausing UMIDs which are identifiers which universally uniquely identifymaterial. UMIDs in principle could identify video material to theaccuracy of one frame. There is a basic UMID and an extended UMID. Abasic UMID has 32 bytes each of 8 bits and an extended UMID has 64bytes. It may be possible in some circumstances to use a reduced datastructure UMID where for instance some data of a plurality of UMIDs iscommon to the plurality of UMIDs.

It is clearly desirable to associate the identifiers as closely aspossible with the material which they identify and most preferablyinclude the identifiers in the material or in the case of materialrecorded on a recording medium record the identifiers on the medium.However, there is little or no spare data capacity in some media,especially tape. In addition it is desirable to record other data suchas Good Shot Markers (GSMs) on the medium with the material. Thus otherdesirable data competes for space on the media.

Summary of the First Aspect of the Invention.

According to the present invention, there is provided a video, audioand/or data signal processing system comprising a recorder for recordingvideo and/or audio and/or data material on a recording medium therecorder including a first generator for generating first materialidentifiers for identifying respective pieces of material on the mediumsuch that each piece is differentiated from other pieces on the medium,and a second generator for generating second, universally unique,identifiers for pieces of material, second identifiers being generatedin respect of one or more of the first identifiers.

A second identifier may be generated for each of the first identifiers.A second identifier may be generated in respect of a group of two ormore first identifiers.

The first identifiers, which need to distinguish the pieces of materialon the medium, but need not be universally unique, can thus be smallerthan universally unique identifiers. For example the first identifiersmay comprise only two bytes. That is sufficient to allow the secondgenerator to generate the second identifiers. Also, it minimises theamount of data which need be stored on the medium to identify materialthereon allowing other data, e.g. Good Shot Markers, to be recorded.

In preferred embodiments, a medium identifier is provided whichidentifies the medium. It is for example a serial number. The secondgenerator generates the second identifiers in dependence on the mediumidentifier and the first identifiers of the material on the medium.

In another embodiment, the medium is housed in a housing supporting adata store. The data store preferably stores the medium identifier andmay also store at least one of the first identifiers. Most preferablythe first identifiers are recorded on the medium and the store storesonly the last produced of the first identifiers to enable the firstgenerator to produce the first identifiers in a regulated manner.

The use of the first identifiers or of the first identifiers plus mediumidentifiers which may be placed on the medium and/or in the data storeallows existing record medium formats especially tape formats to use theidentifiers and be incorporated in a production and distribution systemwhich uses universally unique identifiers such as UMIDs and metadatabases. Existing tape formats can accommodate the first identifiers andtape cassettes having data stores are in current use (at the applicationdate of this application).

The embodiments of the invention address the problem of labelling tapesand other recording media by providing the medium identifier. The use ofshort first identifiers allows GSMs to be recorded. The data store isnot essential but can be used if available.

These and other aspects and advantages of the first aspect of theinvention are set out in the following description of FIGS. 1 to 30.

Second Aspect of the Invention

Description of the Prior Art.

Several formats of digital video tape have been proposed. The firstcommercially successful format was the so-called “D1” format, describedin the book, “Introduction to the 4:2:2 Digital Video Tape Recorder”,Gregory, Pentech Press, 1988. Since then there have been many otherformats, either standardised or proprietary.

A feature that these formats have in common is the use of helicalscanning. This is a well-established technique in which the tape mediumis wrapped at least part of the way around a head drum. One or morerotating read/write heads, mounted on the head drum, sweep outsuccessive slant tracks on the tape medium as the medium is progressedslowly past the head drum. Slant tracks may carry a timecode known insome systems as Vertical Interval Timecode (VITC). Linear tracks mayalso be used to carry information such as Linear Timecode (LTC), othercontrol information, a cueing audio track and the like.

Each slant track is generally divided up into a number of regions orsectors. Although the precise number and layout of these regions variesfrom format to format, there are generally one or more video sectors andone or more audio sectors on each slant track. These can storecompressed or uncompressed video and audio data. In other systems, datarepresenting each video frame or image, or a group of images, may berecorded onto a group of tracks.

Recently, interest has developed in ways of recording so-called metadataalong with the audio and video material. Metadata is additional oraccompanying data defining the audio/video material in some fashion, andcan include data items such as material identifying codes (e.g. theSMPTE Unique Material Identifier or UMID), bibliographic data such ascast or staff lists, copyright information, equipment used and so on. Ofcourse, if any such codes are to be stored alongside the audio/videomaterial on tape, some data capacity needs to be allocated for itsstorage.

One previously proposed solution is to store “small” metadata items suchas material identifiers using the “user bits”, that is a small amount ofuser-definable data within the LTC areas of the tape. Typically the userbits provide only of the order of 4 bytes (32 bits) per frame, of whichsome capacity is taken up by existing schemes such as “good shotmarkers” (GSMs). As an SMPTE UMID occupies at least 32 bytes, and insome forms up to 64 bytes, this solution provides for only a limitedstorage of this data.

Summary of the Second Aspect of the Invention

This second aspect of the invention provides a digital video taperecorder operable to record video and/or audio material on successiveslant tracks, storing a slant track video timecode having a plurality ofuser-definable data bits, and at least one linear track, storing alinear track timecode having a plurality of user-definable data bits, ona tape medium;

the digital video tape recorder being operable to store a materialidentifying code in the user-definable bits of the slant track videotimecode and in the user-definable bits of the linear track timecode.

The invention recognises that previous attempts to store metadata alongwith the audio/video material on tape have suffered from a problemduring “jog” or very slow motion replay of the material. In thesecircumstances, the LTC is unreadable because insufficient head/taperelative speed is obtained.

However, the invention recognises that a solution is not necessarily tobe found simply by switching to use of the user bits in the VITC (orother slant track timecode) because these tend not to be readable duringhigh speed replay such as shuttle operations.

Instead, embodiments of the invention provide for the materialidentifying code to be stored effectively twice—in the LTC and in theVITC. This provides for a reliable replay of the material identifyingcode across a range of replay speeds.

Further aspects and features of the second aspect of the invention aredescribed with reference to FIGS. 28, 29 and 31.

Third Aspect of the Invention

Description of the Prior Art

It is known to store audio and video information in a form in which theinformation may be reproduced when required. Typically this is effectedby recording signals representative of the information onto suitablerecording media in order to provide a reproducible record of the audioand video information signals. For example the conventional consumervideo recorder is provided with an arrangement in which a rotatingmagnetic head records audio and video information signals in tracks on amagnetic tape. Similarly a hand held video camera or “camcorder” isprovided with an arrangement for converting an image formed within afield of view of the camera into video signals which are recorded onto amagnetic tape along with accompanying audio signals. For professionalvideo cameras the preferred medium for recording a reproducible recordof the audio and video signals generated by the camera is to record thevideo signals onto a magnetic tape in some way. The foregoing examplesall use magnetic tape to record information, however as will beappreciated magnetic tape is only one example of a linear recordingmedium in which information signals are recorded onto the medium in someway as the recording medium is driven past a recording head at a ratedetermined by the band width of the information signal being recordedand the capacity of the recording medium to represent these informationsignals.

There is a wide variety and a great number of audio and/or videoproductions which are generated by television companies, by domesticusers as well as professional production companies. Sports programs,musical and opera productions, documentaries and light entertainmentcomedy programmes are but examples of this wide variety of audio and/orvideo productions. The term audio and/or video will be referred toherein as audio/video and includes any form of information representingsound or visual images or a combination of sound and visual images.

As a result of the rich variety and great number of audio/videoproductions in existence, the task of managing and navigating through anarchive of audio/video productions in order to locate particularfeatures or content items of audio/video material within an archive isconsiderably time consuming. Similarly editing an audio/video productiontypically represents a labour intensive task because the audio/videomaterial must be visually scanned by the operator to locate a particularfeature of interest. As a result it has been proposed in our co-pendingUK patent application number GB 9921235.9 to provide an apparatus and amethod for navigating through audio/video information signals usingmetadata.

The term metadata as used herein refers to and includes any form ofinformation or data which serves to describe either the content ofaudio/video material or parameters present or used to generate theaudio/video material or any other information associated with theaudio/video material. Metadata may be, for example, “semantic metadata”which provides contextual/descriptive information about the actualcontent of the audio/video material. Examples of semantic metadata arethe start of periods of dialogue, changes in a scene, introduction ofnew faces or face positions within a scene or any other items associatedwith the source content of the audio/video material. The metadata mayalso be syntactic metadata which is determined and associated with itemsof equipment or parameters which were used whilst generating theaudio/video material such as for example an amount of zoom applied to acamera lens, an aperture and shutter speed setting of the lens, and atime and date when the audio/video material was generated. Althoughmetadata may be recorded with the audio/video material with which it isassociated, either on separate parts of a recording medium or on commonparts of a recording medium, metadata in the sense used herein isintended for use in navigating and identifying features and essence ofthe content of the audio/video material, and may, therefore be separatedfrom the audio/video signals when the audio/video signals arereproduced. The metadata is therefore separable from the audio/videosignals.

Summary of the Third Aspect of the Invention

According to the third aspect of the present invention there is provideda recording apparatus which is arranged in operation to record audioand/or video information signals onto a linear recording medium, theapparatus comprising a recording drive arranged in operation to recordthe information signals onto the linear recording medium, and to recordmetadata associated with the information signals onto the linearrecording media with the information signals, wherein the metadata isrecorded repeatedly.

An improvement is provided to a recording apparatus which is arranged torecord the metadata repeatedly with the audio/video information signals.This is because recording the same metadata repeatedly increases aprobability that the metadata will be recovered correctly when theinformation signals are read from the linear recording medium despiteany loss of information. A loss of information may occur as a result oferrors produced in recording/reproducing the audio/video information andmetadata. A loss of information may also occur when, for example, thelinear recording medium is “shuttled” in which the linear recordingmedium is moved past the recording heads at a rate which is greater thanthe rate at which the information signals were recorded.

The metadata may comprise a plurality of objects and the recordingapparatus may comprise a control processor coupled to the recordingdrive which is arranged in operation to determine a relative importanceof the information represented by the metadata objects and configure therecording drive to record the metadata objects a number of timescorresponding to the relative importance of the metadata objects.

As will be appreciated because there is a considerable variation in thetype and content of metadata, different types of metadata will vary invalue and therefore importance to the audio/video signals with which themetadata is associated. For example, a metadata type of high importanceis the Unique Material Identifier (UMID) whereas metadata of lowimportance is for example the F-stop or aperture setting of the camerawhich was used whilst the audio/video information signals were beinggenerated. By recording the metadata repeatedly in accordance with theimportance of the metadata, a greater immunity to the effect of errorsand to any loss of information which may occur when the informationsignals and metadata are being reproduced from the linear recordingmedium is unequally provided, with the metadata of greater importancebeing provided with greater protection.

Dividing the metadata into metadata objects which define a type andtherefore relative value of the metadata, provides an estimate of therelative importance of the metadata. This is effected by the controlprocessor which also configures the recording drive to record themetadata objects a number of times determined in accordance with therelative importance of the metadata which the objects represent.

The recording drive may be arranged in operation to record theinformation signals onto the linear recording medium at a recordingrate, and the number of times the metadata objects are repeated may bedetermined by the control processor from a combination of the relativeimportance and a reading rate at which the recorded information signalsmay be read from the linear recording medium.

When recording information signals on to a linear recording medium suchas a magnetic tape, the information signals are arranged to be recordedon to the magnetic tape by driving the tape at a recording speed past arecording head which is excited by the information signals. The rate atwhich the tape is driven is determined in accordance with the bandwidthof the information signals, and the capacity of the recording medium torepresent these signals. The rate at which the tape is driven whenrecording the information signals will be known herein as the recordingrate. When the information signals are reproduced from the recordingmedium, the recording medium is again driven passed read heads whichread the information signals from the magnetic tape. The rate of drivingthe recording medium passed the reading head is known herein as thereading rate and in order to recover all the information signals, thereading rate should be the same as the recording rate. However, in amode known as “shuttle” mode, the reading rate may be increased so thatalthough not all the information signals may be recovered from therecording medium an amount is recovered which is sufficient to provide arepresentation of the information signals for fast searching. Thus, aneffect of shuttle mode is to reduce an amount of information which isrecovered from the linear recording medium. Therefore, by repeating thesame metadata object in accordance with the relative importance of themetadata object, the more important metadata objects are more likely tobe correctly recovered.

Furthermore the control processor may operate to determine the number oftimes the metadata objects are repeated from the relative importance anda number of times the reading rate is greater than the recording rate.The number of times the reading rate is greater than the recording rateprovides an integer indication of the number of times the metadataobject must be repeated in order for the metadata object to be recoveredduring shuttle mode.

In a preferred embodiment the recording drive may operate to record theinformation signals and the metadata on to the linear recording mediumwhereby the information signals and the metadata may be separated whenread from the linear recording medium.

The control processor may be arranged in operation to assign each of themetadata objects to one of a plurality of categories of relativeimportance and to allocate each of the metadata objects to one of thecategories of relative importance the number of times the metadataobjects are repeat recorded being predetermined for each of thecategories.

An improvement is provided by arranging the metadata objects into aplurality of categories and repeat recording the metadata objects inaccordance with the category. This provides a reproducing apparatusarranged in operation to reproduce the information signals with animplicit reference to the number of times the same metadata object hasbeen recorded and therefore in dependence upon the reproducing rate thereproducing apparatus may determine whether the same metadata object hasbeen read more than once. In this way, a probability of successfullyrecovering a metadata object may be increased by allocating the metadataobject to a category which has a greater number of predeterminedrepetitions when recorded. Thus by assessing the relative importance,the more important information may be allocated to a category of higherimportance and thereby this information is more likely to be recoveredwhen the metadata and information signals are reproduced.

In a preferred embodiment, for each of the categories of relativeimportance the control processor may be arranged in operation to recordthe same allocated metadata object in each of a plurality of adjacentcells of the recording medium for the predetermined number of times andrecord a subsequent metadata object allocated to the same category forthe predetermined number of times in a corresponding plurality ofadjacent cells. Furthermore the same metadata object may be recordedwith reference to a temporal marker recorded with the informationsignals and the metadata objects. In this way when the reproducingapparatus is recovering the metadata objects, a plurality of cells maybe read from the linear recording medium and with reference to temporalmarker the reproducing apparatus may determine whether the same metadataobject has been recovered.

As an alternative arrangement for repeat recording metadata objects ontothe linear recording medium the control processor may be arranged inoperation to form metadata packets having a plurality of fields, and tocontrol the recording drive to record the metadata packets on to thelinear recording medium, whereby the metadata objects are repeated thepredetermined number of times.

Arranging the metadata objects into packets provides a predefined unitwhich may contain different metadata objects or the same metadataobjects which can be recorded in a controlled way so that recovery ofthe metadata objects may be facilitated.

The control processor may allocate the metadata objects to the fields ofthe packets, whereby the metadata object is repeated in the fields of atleast one of the packets.

Furthermore the control processor may be arranged in operation toallocate a different metadata object to at least two of the fields ofthe packet and record repeatedly the metadata packet the predeterminednumber of times. By providing the packet with different metadata objectsand recording the metadata packet the determined number of times for therelative importance of the metadata objects, the relative protectionprovided by the repeat recording is effected for each of the differentfields within the metadata packet.

The control processor may be arranged in operation to provide each ofthe metadata packets with a header field and to allocate headerinformation to the header field which is indicative of the metadataobjects within the fields of the packet.

Providing the metadata packets with a header which contains informationwhich represents the content of the fields of the packets, facilitatesidentification of the metadata packets and recovery of the metadataobjects from packets which have been repeatedly recorded.

The control processor may be arranged in operation to change the headerinformation between successive packets recorded repeatedly onto thelinear recording medium which have different metadata objects.

Changing the header information between successive packets which havedifferent metadata objects provides a simple and convenient way ofrecognising where metadata packets which are repeatedly recorded changefrom one group to another. By detecting the change in the headerinformation a reproducing apparatus may determine whether the metadatapackets recovered from the recording medium contain more than onemetadata packet which is the same. The reproducing apparatus maytherefore discard redundant metadata packets which are those which arerecovered after a first metadata packet of any one type has beenrecovered.

In a preferred embodiment the linear recording medium is a magnetic tapeand the recording drive has a rotating head which is configured inoperation to record the information signals in helical scan tracksdisposed at an angle to a linear axis of the recording tape and a linearrecording head which is configured in operation to record the metadataalong the linear tracks of the magnetic tape at a position adjacent tothe helical scan tracks. In preferred embodiments the linear tracks areallocated to the user specified bits and the time code allocated to theinformation signals.

According to the third aspect of the present invention there is alsoprovided a reproducing apparatus which is arranged in operation torecover audio and/or video information signals recorded with metadataassociated with the information signals onto a linear recording mediumusing a recording apparatus as herein before described, the reproducingapparatus comprising a reading drive which operates to recover theinformation signals from the linear recording medium and the metadatafrom the linear recording medium and a read control processor which isarranged in operation to determine whether the same metadata has beenread by the reading drive from the linear recording medium, to discardthe metadata which has been read more than once and to reproduce theinformation signals with the metadata.

According to the third aspect of the present invention there is alsoprovided a method of recording audio and/or video information signalsonto a linear recording medium comprising the steps of recording theinformation signals onto the linear recording medium, recording metadataassociated with the information signals onto the linear recording mediumwith the information signals, wherein the metadata is recordedrepeatedly.

BRIEF DESCRIPTION OF DRAWINGS

Further aspects and features of the present invention are defined and inthe following description of FIGS. 32 to 41.

For a better understanding of the present invention, reference will nowbe made, by way of example, to the accompanying drawings in which:

FIGS. 1 to 11 illustrate illustrative versions of the first aspect ofthe present invention;

FIGS. 12 and 13 illustrate UMIDs;

FIG. 14 illustrates a data structure for metadata in a data base;

FIGS. 15 to 21 illustrate an A-Box;

FIGS. 22 to 27 illustrate a B-Box;

FIGS. 28 to 30 illustrate Tape IDs in linear time code;

FIG. 31 illustrates an example of the second aspect of the invention;

FIG. 32 is a schematic block diagram of a video camera;

FIG. 33 is a schematic block diagram of part of the video camera shownin FIG. 1 which includes a recording apparatus,

FIG. 34A is a schematic block plan view of recording heads arranged torecord information onto the magnetic tape within the recording apparatusshown in FIG. 2, and

FIG. 34B is an elevation view of the plan view shown in FIG. 3A,

FIG. 35 is a schematic representation of a position and layout ofinformation recorded onto a magnetic recording tape shown in FIG. 3,

FIG. 36A is a representation of the magnetic recording tape shown inFIG. 4 read at twice the recording speed,

FIG. 36B is a representation of the magnetic tape shown in FIG. 4 readat four times the recording speed, and

FIG. 36C is a representation of the magnetic tape shown in FIG. 4 readat eight times the recording speed,

FIG. 37 is an illustration of a process in which metadata objects areformed into a metadata packet,

FIG. 38 is a representation of an arrangement of metadata objectsrecorded onto the magnetic recording tape,

FIG. 39 is a representation of a further arrangement of metadata objectsrecorded onto the magnetic recording tape,

FIG. 40 is a flow diagram representing a method for producing thearrangement of metadata objects represented in FIG. 38, and

FIG. 41 is a flow diagram representing a method for producing thearrangement of metadata objects represented in FIG. 39.

OVERVIEW-FIRST ASPECT OF THE INVENTION FIGS. 1 to 11

-   -   -   -   -   The following description refers to:

    -   UMIDs which are described with reference to FIGS. 12 and 13;

    -   Metadata which is described with reference to FIG. 14;

    -   A Box which is described with reference to FIGS. 15 to 21;

    -   B Box which is described with reference to—FIGS. 22 to 27; and

    -   Tape IDs which are described with reference to FIGS. 28 to 30.

Referring to FIG. 1, a camcorder 500 is equipped with an A-box 152. Thecamcorder 500 records video and audio material on a recording mediumwhich may be a tape 126 or a disc for example. The following descriptionrefers to tape for convenience and because tape is currently the mostcommon recording medium for camcorders. The tape is housed in a cassettewhich supports a data store 502 additional to the tape. The store 502may be a Telefile (Trade Mark). Recorded material is transferred on thetape 126 to a VTR 204 which is connected to a B-Box 178. Both the A-Boxand the B-Box are linked by communication links 174 and 180 to adatabase processor 176 storing a metadata base. Examples of metadatastored in the database are given in the section Metadata below. Themetadata relates to the material recorded on the tape 126. Metadata maybe recorded on the tape as described herein below with reference toFIGS. 32 to 41.

In the present embodiment of the invention, the metadata is linked tothe material by UMIDs, which are described in the section UMIDs belowand, in accordance with the invention, by at least MURNs—MaterialReference Numbers. UMIDs which are universally unique identifiers have23, 32 or 64 bytes as described below. MURNs have 2 bytes in currentlypreferred embodiments of the invention and are intended to uniquelyidentify each piece of material on the tape but not be universallyunique. Together with the time code bits, the MURNs identify each clipon the tape to frame accuracy.

In the embodiment of FIG. 1, a tape ID is recorded in the datastore 502.Also the tape ID may be applied to the tape. The tape ID may be recordedon the tape during a prestriping process.

MURNs are generated as the material is recorded on the tape. The MURNsare preferably recorded in the user bits of tape time codes. That may bedone in a similar way as recording tape IDs in time code as describedbelow as described with reference to FIGS. 28, 29 and 31. Preferably, atleast the last recorded one of the MURNs is also recorded in the datastore 502. All the MURNs may be recorded in the data store 502 but thatis not essential in this embodiment.

The camcorder 500 has a machine ID which is for example the serialnumber of the camcorder. Preferably in this embodiment the machine ID isrecorded in the data store 502. It may be recorded on the tape if thereis sufficient space.

The tape 126, after material is recorded on it, is transferred to a VTR204 which is coupled to a B-Box 178. The VTR 204 and B-Box 178 togetherread the MURNs, the Tape ID, and the machine ID, and the B-Boxassociates a UMID with each MURN. In this embodiment, the UMID isretrieved from the database 176. Each UMID is associated with acombination of the Tape ID and a MURN and the associated identifiers(UMID, Tape ID and MURN) are stored in the database 176. For thatpurpose the B-Box is coupled to the database processor via acommunications link which may be a network link, e.g. an internet link.

In the embodiment of FIG. 1, the A-Box 152 generates, for each MURN, aUMID and transfers to the database 176, via a communications link 174,the UMIDs, MURNs, tape ID and any metadata which is generated at theA-Box. Thus the UMID is available in the database to be retrieved by theB-Box 178.

Good Shot Markers (GSMs) and/or other metadata may be stored on thetape.

Referring to FIG. 2, the embodiment of FIG. 2 differs from that of FIG.1 in that the A-Box 152 is omitted, MURNs, Tape ID and Machine ID aregenerated by the camcorder without the A-Box and UMIDs are generated bythe B-Box instead of being retrieved from the database 176. Preferably,the Tape ID and machine ID are recorded in the datastore 502 and theMURNs are recorded in the user bits of time codes on the tape. Howeverthey may be recorded in the other ways described with reference to theembodiment of FIG. 1. The UMIDs are synthesised in the B-Box usingmachine ID, Tape ID and MURNs.

Good Shot Markers (GSMs) and/or other metadata may be stored on thetape.

Referring to FIG. 3, the embodiment of FIG. 3 differs from that of FIG.1

in that: the A-Box 152 is omitted; MURNs are generated by the camcorderwithout the A-Box; Tape ID is manually entered at the B-Box; machine IDidentifying the camcorder is not used; the tape has no datastore; andUMIDs are generated by the B-Box instead of being retrieved from thedatabase 176. The MURNs are recorded in the user bits of time codes onthe tape. The UMIDs are synthesised in the B-Box using the MURNs, amachine ID identifying the VTR 204 and the manually entered Tape ID.Preferably the Tape ID is manually written on a physical label on thecassette. The UMIDs and associated MURNs and Tape ID are transferred tothe database 176 via the link 180. It is possible that MURNs areduplicated on a tape in the absence of the datastore: that may bedetected by the database 176.

Good Shot Markers (GSMs) and/or other metadata may be stored on thetape.

Referring to FIG. 4, the embodiment of FIG. 4 differs from that of FIG.1 in that: the tape has no datastore 176; the Tape ID is manuallyentered at the B-Box; machine ID identifying the camcorder 500 is notused; and UMIDs are generated by the B-Box instead of being retrievedfrom the database 176. The MURNs are recorded in the user bits of timecodes on the tape. The UMIDs are synthesised in the B-Box using theMURNs, and the manually entered Tape ID. The Tape ID is manually writtenon a physical label on the cassette. The UMIDs and associated MURNs andTape ID are transferred to the database 176 via the link 180. It ispossible that MURNs are duplicated on a tape in the absence of thedatastore: that may be detected by the database 176.

In the embodiment of FIG. 4 the A-Box 152 associates with each MURN, anymetadata which is generated at the A-Box and the tape ID, which isentered at the A-Box and also written on a label on the tape 126. Thusthe database can associate the metadata, MURNs and tape ID transferredto it from the A-Box with the UMIDs and associated MURNs and tape IDfrom the B-Box.

The B-Box may also use the machine ID of the camcorder 500 inassociation with the UMIDs, and the A-Box may transfer the machine ID ofthe camcorder if that ID is recorded on the tape or written on the labelof the tape.

The Tape ID may be recorded on the tape and thus it may be detected bythe VTR 204 and B-Box avoiding the need to manually enter it at theB-Box.

Good Shot Markers (GSMs) and/or other metadata may be stored on thetape.

Referring to FIG. 5, there is shown a UMID which is further described inthe section UMIDs below. The UMID may be generated based on the data inthe datastore 502 or recorded on the tape, such data including at leastthe MURNs and preferably also the Tape ID and most preferably also themachine ID. FIG. 5 assumes that data is stored in the datastore,(Telefile) 502. When generating the UMID, the UMID type value, byte 11of the universal label, is set, the default being 04H, i.e. group; thecreation type, byte 12 of the universal label, is set locally atingestion; and the material number is set, it comprising 8 bytes of timesnap set at ingest, 2 bytes of random Number (Rnd) and 6 bytes ofMachine Node which is made up of data values passed through thedatastore 502.

Referring to FIGS. 6 to 8, an example of the operation of the system ofFIG. 1 will be given. In FIGS. 6 to 8 “tape label” denotes the datastore502, e.g. a Telefile. It is assumed the Tape ID is stored in at leastthe datastore 502. Referring to FIG. 6A, a blank tape is inserted in tothe camcorder 500. The datastore stores the Tape ID and a number 0indicating the number of tape erasures.

Referring to FIG. 6B, assume 4 clips have been recorded on the tape bythe same camcorder 500, camcorder A. The clips have respective MURNs 1to 4 allocated to them and recorded in the user bits of time codes onthe tape. As shown in FIG. 6C, assume one more clip with MURN 5 isrecorded on the tape and then the tape is ejected from the camcorder500. The MURN 5 is stored in the datastore 502 and the machine ID (or ashortened proxy version thereof) is stored in the datastore 502. TheMURN 5 is recorded in the data store to allow the next correct MURNnumber to be generated when the tape is next used.

Referring to FIG. 7A assume the same tape is inserted into anothercamcorder B. The datastore 502 indicates the last recorded MURN is 5. Asshown in FIG. 7B, more clips are recorded on the tape, and the clips aregiven MURNs 6, 7 . . . . As shown in FIG. 7C, when the tape is ejectedfrom the camcorder B, the Machine ID B is recorded in the data storetogether with the last recorded MURN (7). The machine ID of camcorder Ais retained and the last MURN (5) recorded with machine A is retained.

Referring to FIG. 8A, assume the same tape is inserted into anothermachine C which is for example a VTR. The ID, C, of the machine isrecorded in the datastore 502. The tape is partially erased, in thisexample over clips 2 to 6. The erased zone is denoted by MURN 8 which isrecorded in the data store. An erasure number=1 is stored in data store502.

FIGS. 8B and C show alternatives for full erasure. In FIG. 8B, the fullyerased tape is given MURN 8 which is recorded in the datastore and theerasure number is stored as 1 in the datastore. In FIG. 8C, the fullyerased tape is given MURN 0 and the datastore is cleared except for theTape ID.

Those skilled in the art will be able to relate FIGS. 6 to 8 to FIGS. 2to 4.

FIGS. 9 to 11 illustrate editing rules. Every tape edit event generatesa new MURN. A MURN treats all tape content as a group, e.g. video plusaudio1 plus audio2 plus . . . audio n. Referring to FIG. 10, a tape has2 audio channels A1 and A2 and 1 video channel V. After editing, thevideo channel has a first clip (UMID 123) up to time t1, a second clip(UMID 124) from time t2 onwards and a mix of the two clips between timest1 and t2. Audio A1 has a first audio section (UMID 123 up to time t3and a second section from time t3 onwards. Audio A2 has a first sectionup to time t4 and a second section from t4 onwards. Thus there are thedifferent groups of audio and video and thus the different MURNsindicated in the database 176 in FIG. 10. The UMIDs are generated fromthe MURNs and Tape ID.

FIG. 11 shows an example of inserting a section of audio, e.g. avoice-over (VO) into an audio channel. The voice over is identified byUMID #0002 in the audio channel A3. Group UMIDs #01 Gp, #02 Gp identifygrouped material. Thus Group UMID #02 is associated with: UMID #0001 inrespect of the video channel V, and audio channels A1, A2, and A4; theIN and OUT time codes (TC) of the voice over; and the tape ID.

UMIDs—FIGS. 12 and 13

UMIDs

A UMID is described in SMPTE Journal March 2000. Referring to FIG. 12 anextended UMID is shown. It comprises a first set of 32 bytes of basicUMID and a second set of 32 bytes of signature metadata.

The first set of 32 bytes is the basic UMID. The components are:

-   -   A 12-byte Universal Label to identify this as a SMPTE UMID. It        defines the type of material which the UMID identifies and also        defines the methods by which the globally unique Material and        locally unique Instance numbers are created.    -   A 1-byte length value to define the length of the remaining part        of the UMID.    -   A 3-byte Instance number which is used to distinguish between        different ‘instances’ of material with the same Material number.    -   A 16-byte Material number which is used to identify each clip.        Each Material number is the same for related instances of the        same material.

The second set of 32 bytes of the signature metadata as a set of packedmetadata items used to create an extended UMID. The extended UMIDcomprises the basic UMID followed immediately by signature metadatawhich comprises:

-   -   An 8-byte time/date code identifying the time and date of the        Content Unit creation.    -   A 12-byte value which defines the spatial co-ordinates at the        time of Content Unit creation.    -   3 groups of 4-byte codes which register the country,        organisation and user codes

Each component of the basic and extended UMIDs will now be defined inturn.

The 12-Byte Universal Label

The first 12 bytes of the UMID provide identification of the UMID by theregistered string value defined in table 1. TABLE 1 Specification of theUMID Universal Label Byte No. Description Value (hex) 1 ObjectIdentifier 06h 2 Label size 0Ch 3 Designation: ISO 2Bh 4 Designation:SMPTE 34h 5 Registry: Dictionaries 01h 6 Registry: Metadata Dictionaries01h 7 Standard: Dictionary Number 01h 8 Version number 01h 9 Class:Identification and location 01h 10 Sub-class: Globally UniqueIdentifiers 01h 11 Type: UMID (Picture, Audio, Data, Group) 01, 02, 03,04h 12 Type: Number creation method XXh

The hex values in table 1 may be changed: the values given are examples.Also the bytes 1-12 may have designations other than those shown by wayof example in the table. Referring to the Table 1, in the example shownbyte 4 indicates that bytes 5-12 relate to a data format agreed bySMPTE. Byte 5 indicates that bytes 6 to 10 relate to “dictionary” data.Byte 6 indicates that such data is “metadata” defined by bytes 7 to 10.Byte 7 indicates the part of the dictionary containing metadata definedby bytes 9 and 10. Byte 10 indicates the version of the dictionary. Byte9 indicates the class of data and Byte 10 indicates a particular item inthe class.

In the present embodiment bytes 1 to 10 have fixed preassigned values.Byte 11 is variable. Thus referring to FIG. 13, and to Table 1 above, itwill be noted that the bytes 1 to 10 of the label of the UMID are fixed.Therefore they may be replaced by a 1 byte ‘Type’ code T representingthe bytes 1 to 10. The type code T is followed by a length code L. Thatis followed by 2 bytes, one of which is byte 11 of Table 1 and the otherof which is byte 12 of Table 1, an instance number (3 bytes) and amaterial number (16 bytes). Optionally the material number may befollowed by the signature metadata of the extended UMID and/or othermetadata.

The UMID type (byte 11) has 4 separate values to identify each of 4different data types as follows:

‘01h’=UMID for Picture material

‘02h’=UMID for Audio material

‘03h’=UMID for Data material

‘04h’=UMID for Group material (i.e. a combination of related essence).

The last (12th) byte of the 12 byte label identifies the methods bywhich the material and instance numbers are created. This byte isdivided into top and bottom nibbles where the top nibble defines themethod of Material number creation and the bottom nibble defines themethod of Instance number creation.

Length

The Length is a 1-byte number with the value ‘13h’ for basic UMIDs and‘33h’ for extended UMIDs.

Instance Number

The Instance number is a unique 3-byte number which is created by one ofseveral means defined by the standard. It provides the link between aparticular ‘instance’ of a clip and externally associated metadata.Without this instance number, all material could be linked to anyinstance of the material and its associated metadata.

The creation of a new clip requires the creation of a new Materialnumber together with a zero Instance number. Therefore, a non-zeroInstance number indicates that the associated clip is not the sourcematerial. An Instance number is primarily used to identify associatedmetadata related to any particular instance of a clip.

Material Number

The 16-byte Material number is a non-zero number created by one ofseveral means identified in the standard. The number is dependent on a6-byte registered port ID number, time and a random number generator.

Signature Metadata

Any component from the signature metadata may be null-filled where nomeaningful value can be entered. Any null-filled component is whollynull-filled to clearly indicate a downstream decoder that the componentis not valid.

The Time-Date Format

The date-time format is 8 bytes where the first 4 bytes are a UTC(Universal Time Code) based time component. The time is defined eitherby an AES3 32-bit audio sample clock or SMPTE 12M depending on theessence type.

The second 4 bytes define the date based on the Modified Julian Data(MJD) as defined in SMPTE 309M. This counts up to 999,999 days aftermidnight on the 17 Nov. 1858 and allows dates to the year 4597.

The Spatial Co-ordinate Format

The spatial co-ordinate value consists of three components defined asfollows:

-   -   Altitude: 8 decimal numbers specifying up to 99,999,999 metres.    -   Longitude: 8 decimal numbers specifying East/West 180.00000        degrees (5 decimal places active).    -   Latitude: 8 decimal numbers specifying North/South 90.00000        degrees (5 decimal places active).

The Altitude value is expressed as a value in metres from the centre ofthe earth thus allowing altitudes below the sea level.

It should be noted that although spatial co-ordinates are static formost clips, this is not true for all cases. Material captured from amoving source such as a camera mounted on a vehicle may show changingspatial co-ordinate values.

Country Code

The Country code is an abbreviated 4-byte alpha-numeric string accordingto the set defined in ISO 3166. Countries which are not registered canobtain a registered alpha-numeric string from the SMPTE RegistrationAuthority.

Organisation Code

The Organisation code is an abbreviated 4-byte alpha-numeric stringregistered with SMPTE. Organisation codes have meaning only in relationto their registered Country code so that Organisation codes can have thesame value in different countries.

User Code

The User code is a 4-byte alpha-numeric string assigned locally by eachorganisation and is not globally registered. User codes are defined inrelation to their registered Organisation and Country codes so that Usercodes may have the same value in different organisations and countries.

Freelance Operators

Freelance operators may use their country of domicile for the countrycode and use the Organisation and User codes concatenated to e.g. an 8byte code which can be registered with SMPTE. These freelance codes maystart with the ‘˜’ symbol (ISO 8859 character number 7Eh) and followedby a registered 7 digit alphanumeric string.

Metadata—FIG. 14

The following is provided, by way of example, to illustrate the possibletypes of metadata generated during the production of a programme, andone possible organisational approach to structuring that metadata in adata base. UMIDs link the metadata in the database to the material onthe tape.

FIG. 14 illustrates an example structure for organising metadata. Anumber of tables each comprising a number of fields containing metadataare provided. The tables may be associated with each other by way ofcommon fields within the respective tables, thereby providing arelational structure. Also, the structure may comprise a number ofinstances of the same table to represent multiple instances of theobject that the table may represent. The fields may be formatted in apredetermined manner. The size of the fields may also be predetermined.Example sizes include “Int” which represents 2 bytes, “Long Int” whichrepresents 4 bytes and “Double” which represents 8 bytes. Alternatively,the size of the fields may be defined with reference to the number ofcharacters to be held within the field such as, for example, 8, 10, 16,32, 128, and 255 characters.

Turning to the structure in more detail, there is provided a ProgrammeTable. The Programme Table comprises a number of fields includingProgramme ID (PID), Title, Working Title, Genre ID, Synopsis, AspectRatio, Director ID and Picturestamp. Associated with the Programme Tableis a Genre Table, a Keywords Table, a Script Table, a People Table, aSchedule Table and a plurality of Media Object Tables.

The Genre Table comprises a number of fields including Genre ID, whichis associated with the Genre ID field of the Programme Table, and GenreDescription.

The Keywords Table comprises a number of fields including Programme ID,which is associated with the Programme ID field of the Programme Table,Keyword ID and Keyword.

The Script Table comprises a number of fields including Script ID,Script Name, Script Type, Document Format, Path, Creation Date, OriginalAuthor, Version, Last Modified, Modified By, PID associated withProgramme ID and Notes. The People Table comprises a number of fieldsincluding Image.

The People Table is associated with a number of Individual Tables and anumber of Group Tables. Each Individual Table comprises a number offields including Image. Each Group Table comprises a number of fieldsincluding Image. Each Individual Table is associated with either aProduction Staff Table or a Cast Table.

The Production Staff Table comprises a number of fields includingProduction Staff ID, Surname, Firstname, Contract ID, Agent, Agency ID,E-mail, Address, Phone Number, Role ID, Notes, Allergies, DOB, NationalInsurance Number and Bank ID and Picture Stamp.

The Cast Table comprises a number of fields including Cast ID, Surname,Firstname, Character Name, Contract ID, Agent, Agency ID, Equity Number,E-mail, Address, Phone Number, DOB and Bank ID and Picture Stamp.Associated with the Production Staff Table and Cast Table are a BankDetails Table and an Agency Table.

The Bank Details Table comprises a number of fields including Bank ID,which is associated with the Bank ID field of the Production Staff Tableand the Bank ID field of the Cast Table, Sort Code, Account Number andAccount Name.

The Agency Table comprises a number of fields including Agency ID, whichis associated with the Agency ID field of the Production Staff Table andthe Agency ID field of the Cast Table, Name, Address, Phone Number, WebSite and E-mail and a Picture Stamp. Also associated with the ProductionStaff Table is a Role Table.

The Role Table comprises a number of fields including Role ID, which isassociated with the Role ID field of the Production Staff Table,Function and Notes and a Picture Stamp. Each Group Table is associatedwith an Organisation Table.

The Organisation Table comprises a number of fields includingOrganisation ID, Name, Type, Address, Contract ID, Contact Name, ContactPhone Number and Web Site and a Picture Stamp.

Each Media Object Table comprises a number of fields including MediaObject ID, Name, Description, Picturestamp, PID, Format, schedule ID,script ID and Master ID. Associated with each Media Object Table is thePeople Table, a Master Table, a Schedule Table, a Storyboard Table, ascript table and a number of Shot Tables.

The Master Table comprises a number of fields including Master ID, whichis associated with the Master ID field of the Media Object Table, Title,Basic UMID, EDL ID, Tape ID and Duration and a Picture Stamp.

The Schedule Table comprises a number of fields including Schedule ID,Schedule Name, Document Format, Path, Creation Date, Original Author,Start Date, End Date, Version, Last Modified, Modified By and Notes andPID which is associated with the programme ID.

The contract table contains: a contract ID which is associated with thecontract ID of the Production staff, cast, and organisation tables;commencement date, rate, job title, expiry date and details.

The Storyboard Table comprises a number of fields including StoryboardID, which is associated with the Storyboard ID of the shot Table,Description, Author, Path and Media ID.

Each Shot Table comprises a number of fields including Shot ID, PID,Media ID, Title, Location ID, Notes, Picturestamp, script ID, scheduleID, and description. Associated with each Shot Table is the PeopleTable, the Schedule Table, script table, a Location Table and a numberof Take Tables.

The Location Table comprises a number of fields including Location ID,which is associated with the Location ID field of the Shot Table, GPS,Address, Description, Name, Cost Per Hour, Directions, Contact Name,Contact Address and Contact Phone Number and a Picture Stamp.

Each Take Table comprises a number of fields including Basic UMID, TakeNumber, Shot ID, Media ID, Timecode IN, Timecode OUT, Sign Metadata,Tape ID, Camera ID, Head Hours, Videographer, IN Stamp, OUT Stamp. LensID, AUTOID ingest ID and Notes. Associated with each Take Table is aTape Table, a Task Table, a Camera Table, a lens table, an ingest tableand a number of Take Annotation Tables.

The Ingest table contains an Ingest ID which is associated with theIngest Id in the take table and a description.

The Tape Table comprises a number of fields including Tape ID, which isassociated with the Tape ID field of the Take Table, PID, Format, MaxDuration, First Usage, Max Erasures, Current Erasure, ETA (estimatedtime of arrival) and Last Erasure Date and a Picture Stamp.

The Task Table comprises a number of fields including Task ID, PID,Media ID, Shot ID, which are associated with the Media ID and Shot IDfields respectively of the Take Table, Title, Task Notes, DistributionList and CC List. Associated with the Task Table is a Planned ShotTable.

The Planned Shot Table comprises a number of fields including PlannedShot ID, PID, Media ID, Shot ID, which are associated with the PID,Media ID and Shot ID respectively of the Task Table, Director, ShotTitle, Location, Notes, Description, Videographer, Due date, Programmetitle, media title Aspect Ratio and Format.

The Camera Table comprises a number of fields including Camera ID, whichis associated with the Camera ID field of the Take Table, Manufacturer,Model, Format, Serial Number, Head Hours, Lens ID, Notes, Contact Name,Contact Address and Contact Phone Number and a Picture Stamp.

The Lens Table comprises a number of fields including Lens ID, which isassociated with the Lens ID field of the Take Table, Manufacturer,Model, Serial Number, Contact Name, Contact Address and Contact PhoneNumber and a Picture Stamp.

Each Take Annotation Table comprises a number of fields including TakeAnnotation ID, Basic UMID, Timecode, Shutter Speed, Iris, Zoom, Gamma,Shot Marker ID, Filter Wheel, Detail and Gain. Associated with each TakeAnnotation Table is a Shot Marker Table.

The Shot Marker Table comprises a number of fields including Shot MarkerID, which is associated with the Shot Marker ID of the Take AnnotationTable, and Description.

A Box—FIGS. 15 to 21

Embodiments of the present invention relate to audio and/or videogeneration apparatus which may be for example television cameras, videocameras or camcorders. An embodiment of the present invention will nowbe described with reference to FIG. 15 which provides a schematic blockdiagram of a video camera which is arranged to communicate to a personaldigital assistant (PDA). A PDA is an example of a data processor whichmay be arranged in operation to generate metadata in accordance with auser's requirements. The term personal digital assistant is known tothose acquainted with the technical field of consumer electronics as aportable or hand held personal organiser or data processor which includean alpha numeric key pad and a hand writing interface.

In FIG. 15 a video camera 101 is shown to comprise a camera body 102which is arranged to receive light from an image source falling within afield of view of an imaging arrangement 104 which may include one ormore imaging lenses (not shown). The camera also includes a view finder106 and an operating control unit 108 from which a user can control therecording of signals representative of the images formed within thefield of view of the camera. The camera 101 also includes a microphone110 which may be a plurality of microphones arranged to record sound instereo. Also shown in FIG. 15 is a hand-held PDA 112 which has a screen114 and an alphanumeric key pad 116 which also includes a portion toallow the user to write characters recognised by the PDA. The PDA 112 isarranged to be connected to the video camera 101 via an interface 118.The interface 118 is arranged in accordance with a predeterminedstandard format such as, for example an RS232 or the like. The interface118 may also be effected using infra-red signals, whereby the interface118 is a wireless communications link. The interface 118 provides afacility for communicating information with the video camera 101. Thefunction and purpose of the PDA 112 will be explained in more detailshortly. However in general the PDA 112 provides a facility for sendingand receiving metadata generated using the PDA 112 and which can berecorded with the audio and video signals detected and captured by thevideo camera 1. A better understanding of the operation of the videocamera 101 in combination with the PDA 112 may be gathered from FIG. 16which shows a more detailed representation of the body 102 of the videocamera which is shown in FIG. 15 and in which common parts have the samenumerical designations.

In FIG. 16 the camera body 102 is shown to comprise a tape drive 122having read/write heads 124 operatively associated with a magneticrecording tape 126. Also shown in FIG. 16 the camera body includes ametadata generation processor 128 coupled to the tape drive 122 via aconnecting channel 130. Also connected to the metadata generationprocessor 128 is a data store 132, a clock 136 and three sensors 138,140, 142. The interface unit 118 sends and receives data also shown inFIG. 16 via a wireless channel 119. Correspondingly two connectingchannels for receiving and transmitting data respectively, connect theinterface unit 118 to the metadata generation processor 128 viacorresponding connecting channels 148 and 150. The metadata generationprocessor is also shown to receive via a connecting channel 151 theaudio/video signals generated by the camera. The audio/video signals arealso fed to the tape drive 122 to be recorded on to the tape 126.

The video camera 110 shown in FIG. 15 operates to record visualinformation falling within the field of view of the lens arrangement 104onto a recording medium. The visual information is converted by thecamera into video signals. In combination, the visual images arerecorded as video signals with accompanying sound which is detected bythe microphone 101 and arranged to be recorded as audio signals on therecording medium with the video signals. As shown in FIG. 16, therecording medium is a magnetic tape 126 which is arranged to record theaudio and video signals onto the recording tape 126 by the read/writeheads 124. The arrangement by which the video signals and the audiosignals are recorded by the read/write heads 124 onto the magnetic tape126 is not shown in FIG. 16 and will not be further described as thisdoes not provide any greater illustration of the example embodiment ofthe present invention. However once a user has captured visual imagesand recorded these images using the magnetic tape 126 as with theaccompanying audio signals, metadata describing the content of theaudio/video signals may be input using the PDA 112. As will be explainedshortly this metadata can be information that identifies the audio/videosignals in association with a pre-planned event, such as a ‘take’. Asshown in FIG. 16 the interface unit 118 provides a facility whereby themetadata added by the user using the PDA 112 may be received within thecamera body 102. Data signals may be received via the wireless channel119 at the interface unit 118. The interface unit 118 serves to convertthese signals into a form in which they can be processed by theacquisition processor 128 which receives these data signals via theconnecting channels 148, 150.

Metadata is generated automatically by the metadata generation processor128 in association with the audio/video signals which are received viathe connecting channel 151. In the example embodiment illustrated inFIG. 16, the metadata generation processor 128 operates to generate timecodes with reference to the clock 136, and to write these time codes onto the tape 126 in a linear recording track provided for this purpose.The time codes are formed by the metadata generation processor 128 fromthe clock 136. Furthermore, the metadata generation processor 128 formsother metadata automatically such as a UMID, which identifies uniquelythe audio/video signals. The metadata generation processor may operatein combination with the tape driver 124, to write the UMID on to thetape with the audio/video signals.

In an alternative embodiment, the UMID, as well as other metadata may bestored in the data store 132 and communicated separately from the tape126. In this case, a tape ID is generated by the metadata generationprocessor 128 and written on to the tape 126, to identify the tape 126from other tapes.

In order to generate the UMID, and other metadata identifying thecontents of the audio/video signals, the metadata generation processor128 is arranged in operation to receive signals from other sensor 138,140, 142, as well as the clock 136. The metadata generation processortherefore operates to co-ordinate these signals and provides themetadata generation processor with metadata such as the aperture settingof the camera lens 104, the shutter speed and a signal received via thecontrol unit 108 to indicate that the visual images captured are a “goodshot”. These signals and data are generated by the sensors 138, 140, 142and received at the metadata generation processor 128. The metadatageneration processor in the example embodiment is arranged to producesyntactic metadata which provides operating parameters which are used bythe camera in generating the video signals. Furthermore the metadatageneration processor 128 monitors the status of the camcorder 101, andin particular whether audio/video signals are being recorded by the tapedrive 124. When RECORD START is detected the IN POINT time code iscaptured and a UMID is generated in correspondence with the IN POINTtime code. Furthermore in some embodiments an extended UMID isgenerated, in which case the metadata generation processor is arrangedto receive spatial co-ordinates which are representative of the locationat which the audio/video signals are acquired. The spatial co-ordinatesmay be generated by a receiver which operates in accordance with theGlobal Positioning System (GPS). The receiver may be external to thecamera, or may be embodied within the camera body 102.

When RECORD START is detected, the OUT POINT time code is captured bythe metadata generation processor 128. As explained above, it ispossible to generate a “good shot” marker. The “good shot” marker isgenerated during the recording process, and detected by the metadatageneration processor. The “good shot” marker is then either stored onthe tape, or within the data store 132, with the corresponding IN POINTand OUT POINT time codes.

As already indicated above, the PDA 112 is used to facilitateidentification of the audio/video material generated by the camera. Tothis end, the PDA is arranged to associate this audio/video materialwith pre-planned events such as scenes, shots or takes. The camera andPDA shown in FIGS. 15 and 16 form part of an integrated system forplanning, acquiring, editing an audio/video production. During aplanning phase, the scenes which are required in order to produce anaudio/video production are identified. Furthermore for each scene anumber of shots are identified which are required in order to establishthe scene. Within each shot, a number of takes may be generated and fromthese takes a selected number may be used to form the shot for the finaledit. The planning information in this form is therefore identified at aplanning stage. Data representing or identifying each of the plannedscenes and shots is therefore loaded into the PDA 112 along with noteswhich will assist the director when the audio/video material iscaptured. An example of such data is shown in the table below. A/VProduction News story: BMW disposes of Rover Scene ID: 900015689 OutsideLongbridge Shot 5000000199 Longbridge BMW Sign Shot 5000000200 WorkersLeaving shift Shot 5000000201 Workers in car park Scene ID: 900015690BMW HQ Munich Shot 5000000202 Press conference Shot 5000000203 OutsideBMW building Scene ID: 900015691 Interview with minister Shot 5000000204Interview

In the first column of the table below the event which will be capturedby the camera and for which audio/video material will be generated isshown. Each of the events which is defined in a hierarchy is providedwith an identification number. Correspondingly, in the second columnnotes are provided in order to direct or remind the director of thecontent of the planned shot or scene. For example, in the first row theaudio/video production is identified as being a news story, reportingthe disposal of Rover by BMW. In the extract of the planning informationshown in the table below, there are three scenes, each of which isprovided with a unique identification number. Each of these scenes are“Outside Long Bridge”, “BMW HQ Munich” and “Interview with Minister”.Correspondingly for each scene a number of shots are identified andthese are shown below each of the scenes with a unique shotidentification number. Notes corresponding to the content of each ofthese shots are also entered in the second column. So, for example, forthe first scene “Outside Long Bridge”, three shots are identified whichare “Long Bridge BMW”, “Workers leaving shift” and “Workers in carpark”. With this information loaded onto the PDA, the director or indeeda single camera man may take the PDA out to the place where the newstory is to be shot, so that the planned audio/video material can begathered. An illustration of the form of the PDA with the graphical userinterface displaying this information is shown in FIG. 17.

As indicated in FIG. 15, the PDA 112 is arranged to communicate data tothe camera 111. To this end the metadata generation processor 128 isarranged to communicate data with the PDA 112 via the interface 118. Theinterface 118 maybe for example an infra-red link 119 providing wirelesscommunications in accordance with a known standard. The PDA and theparts of the camera associated with generating metadata which are shownin FIG. 16 are shown in more detail in 18.

In 18 the parts of the camera which are associated with generatingmetadata and communicating with the PDA 112 are shown in a separateacquisition unit 152. However it will be appreciated that theacquisition unit 152 could also be embodied within the camera 102. Theacquisition unit 152 comprises the metadata generation processor 128,and the data store 132. The acquisition processor 152 also includes theclock 136 and the sensors 138, 140, 142 although for clarity these arenot shown in FIG. 18. Alternatively, some or all of these features whichare shown in 16 will be embodied within the camera 102 and the signalswhich are required to define the metadata such as the time codes and theaudio/video signals themselves may be communicated via a communicationslink 153 which is coupled to an interface port 154. The metadatageneration processor 128 is therefore provided with access to the timecodes and the audio/video material as well as other parameters used ingenerating the audio/video material. Signals representing the time codesend parameters as well as the audio/video signals are received from theinterface port 154 via the interface channel 156. The acquisition unit152 is also provided with a screen (not shown) which is driven by ascreen driver 158. Also shown in FIG. 18 the acquisition unit isprovided with a communications processor 160 which is coupled to themetadata generation processor 128 via a connecting channel 162.Communications is effected by the communications processor 160 via aradio frequency communications channel using the antennae 164. Apictorial representation of the acquisition unit 152 is shown in 19.

The PDA 112 is also shown in FIG. 18. The PDA 112 is correspondinglyprovided with an infra-red communications port 165 for communicatingdata to and from the acquisition unit 152 via an infra-red link 119. Adata processor 166 within the PDA 112 is arranged to communicate data toand from the infra-red port 165 via a connecting channel 166. The PDA112 is also provided with a data store 167 and a screen driver 168 whichare connected to the data processor 166.

The pictorial representation of the PDA 112 shown in FIG. 17 and theacquisition unit shown in FIG. 19 provide an illustration of an exampleembodiment of the present invention. A schematic diagram illustratingthe arrangement and connection of the PDA 112 and the acquisition unit152 is shown in FIG. 20. In the example shown in FIG. 20 the acquisitionunit 152 is mounted on the back of a camera 101 and coupled to thecamera via a six pin remote connector and to a connecting channelconveying the external signal representative of the time code recordedonto the recording tape. Thus, the six pin remote connector and the timecode indicated as arrow lines form the communications channel 153 shownin FIG. 18. The interface port 154 is shown in FIG. 20 to be a VA to DN1conversion comprising an RM-P9/LTC to RS422 converter 154. RM-P9 is acamera remote control protocol, whereas LTC is Linear Time Code in theform of an analogue signal. This is arranged to communicate with a RS422to RS232 converter 154″ via a connecting channel which forms part of theinterface port 154. The converter 154″ then communicates with themetadata generation processor 128 via the connecting channel 156 whichoperates in accordance with the RS 232 standard.

Returning to FIG. 18, the PDA 112 which has been loaded with thepre-planned production information is arranged to communicate thecurrent scene and shot for which audio/video material is to be generatedby communicating the next shot ID number via the infra-red link 119. Thepre-planned information may also have been communicated to theacquisition unit 152 and stored in the data store 132 via a separatelink or via the infra-red communication link 119. However in effect theacquisition unit 152 is directed to generate metadata in associationwith the scene or shot ID number which is currently being taken. Afterreceiving the information of the current shot the camera 102 is thenoperated to make a “take of the shot”. The audio/video material of thetake is recorded onto the recording tape 126 with corresponding timecodes. These time codes are received along with the audio/video materialvia the interface port 154 at the metadata generation processor 128. Themetadata generation processor 128 having been informed of the currentpre-planned shot now being taken logs the time codes for each take ofthe shot. The metadata generation processor therefore logs the IN andOUT time codes of each take and stores these in the data store 132.

The information generated and logged by the metadata generationprocessor 128 is shown in the table below. In the first column the sceneand shot are identified with the corresponding ID numbers, and for eachshot several takes are made by the camera operator which are indicatedin a hierarchical fashion. Thus, having received information from thePDA 112 of the current shot, each take made by the camera operator islogged by the metadata generation processor 128 and the IN and OUTpoints for this take are shown in the second and third columns andstored in the data store 132. This information may also be displayed onthe screen of the acquisition unit 152 as shown in FIG. 19. Furthermore,the metadata generation processor 128 as already explained generates theUMID for each take for the audio/video material generated during thetake. The UMID for each take forms the fourth column of the table.Additionally, in some embodiments, to provide a unique identification ofthe tape once which the material is recorded, a tape identification isgenerated and associated with the metadata. The tape identification maybe written on to the tape, or stored on a random access memory chipwhich is embodied within the video tape cassette body. This randomaccess memory chip is known as a TELEFILE® system which provides afacility for reading the tape ID number remotely. The tape ID is writtenonto the magnetic tape 126 to uniquely identify this tape. In preferredembodiments the TELEFILE® system is provided with a unique number whichmanufactured as part of the memory and so can be used as the tape IDnumber. In other embodiments the TELEFILE® system provides automaticallythe IN/OUT time codes of the recorded audio/video material items.

In one embodiment the information shown in the table below is arrangedto be recorded onto the magnetic tape in a separate recording channel.However, in other embodiments the metadata shown in the table iscommunicated separately from the tape 126 using either thecommunications processor 160 or the infra-red link 119. The metadatamaybe received by the PDA 112 for analysis and may be furthercommunicated by the PDA. Scene ID: 900015689 Tape ID: 00001 UMID: Shot5000000199 Take 1 IN: 00:03:45:29 OUT: 00:04:21:05 060C23B340 . . . Take2 IN: 00:04:21:20 OUT: 00:04:28:15 060C23B340 . . . Take 3 IN:00:04:28:20 OUT: 00:05:44:05 060C23B340 . . . Shot 5000000200 Take 1 IN:00:05:44:10 OUT: 00:08:22:05 060C23B340 . . . Take 2 IN: 00:08:22:10OUT: 00:08:23:05 060C23B340 . . .

The communications processor 160 may be arranged in operation totransmit the metadata generated by the metadata generation processor 128via a wireless communications link. The metadata maybe received via thewireless communications link by a remotely located studio which can thenacquire the metadata and process this metadata ahead of the audio/videomaterial recorded onto the magnetic tape 126. This provides an advantagein improving the rate at which the audio/video production may begenerated during the post production phase in which the material isedited.

A further advantageous feature provided by embodiments of the presentinvention is an arrangement in which a picture stamp is generated atcertain temporal positions within the recorded audio/video signals. Apicture stamp is known to those skilled in the art as being a digitalrepresentation of an image and in the present example embodiment isgenerated from the moving video material generated by the camera. Thepicture stamp may be of lower quality in order to reduce an amount ofdata required to represent the image from the video signals. Thereforethe picture stamp may be compression encoded which may result in areduction in quality. However a picture stamp provides a visualindication of the content of the audio/video material and therefore is avaluable item of metadata. Thus, the picture stamp may for example begenerated at the IN and OUT time codes of a particular take. Thus, thepicture stamps may be associated with the metadata generated by themetadata generation processor 128 and stored in the data store 132. Thepicture stamps are therefore associated with items of metadata such as,for example, the time codes which identify the place on the tape wherethe image represented by the picture stamp is recorded. The picturestamps may be generated with the “Good Shot” markers. The picture stampsare generated by the metadata generation processor 128 from theaudio/video signals received via the communications link 153. Themetadata generation processor therefore operates to effect a datasampling and compression encoding process in order to produce thepicture stamps. Once the picture stamps have been generated they can beused for several purposes. They may be stored in a data file andcommunicated separately from the tape 126, or they may be stored on thetape 126 in compressed form in a separate recording channel.Alternatively in preferred embodiments picture stamps may becommunicated using the communications processor 160 to the remotelylocated studio where a producer may analysis the picture stamps. Thisprovides the producer with an indication as to whether the audio/videomaterial generated by the camera operator is in accordance with what isrequired.

In a yet further embodiment, the picture stamps are communicated to thePDA 112 and displayed on the PDA screen. This may be effected via theinfra-red port 119 or the PDA may be provided with a further wirelesslink which can communicate with the communications processor 160. Inthis way a director having the hand held PDA 112 is provided with anindication of the current audio/video content generated by the camera.This provides an immediate indication of the artist and aestheticquality of the audio/video material currently being generated. Asalready explained the picture stamps are compression encoded so thatthey may be rapidly communicated to the PDA.

A further advantage of the acquisition unit 152 shown in FIG. 19 is thatthe editing process is made more efficient by providing the editor at aremotely located studio with an indication of the content of theaudio/video material in advance of receiving that material. This isbecause the picture stamps are communication with the metadata via awireless link so that the editor is provided with an indication of thecontent of the audio/video material in advance of receiving theaudio/video material itself. In this way the bandwidth of theaudio/video material can remain high with a correspondingly high qualitywhilst the metadata and picture stamps are at a relatively low bandwidth providing relatively low quality information. As a result of thelow band width the metadata and picture stamps may be communicated via awireless link on a considerably lower band width channel. Thisfacilitates rapid communication of the metadata describing content ofthe audio/video material.

The picture stamps generated by the metadata generation processor 128can be at any point during the recorded audio/video material. In oneembodiment the picture stamps are generated at the IN and OUT points ofeach take. However in other embodiments of the present invention as anactivity processor 170 is arranged to detect relative activity withinthe video material. This is effected by performing a process in which ahistogram of the colour components of the images represented by thevideo signal is compiled and the rate of change of the colour componentsdetermined and changes in these colour components used to indicateactivity within the image. Alternatively or in addition, motion vectorswithin the image are used to indicate activity. The activity processor176 then operates to generate a signal indicative of the relativeactivity within the video material. The metadata generation processor128 then operates in response to the activity signal to generate picturestamps such more picture stamps are generated for greater activitywithin the images represented by the video signals.

In an alternative embodiment of the present invention the activityprocessor 170 is arranged to receive the audio signals via theconnecting channel 172 and to recognise speech within the audio signals.The activity processor 170 then generates content data representative ofthe content of this speech as text. The text data is then communicatedto the data processor 128 which may be stored in the data store 132 orcommunicated with other metadata via the communications processor 160 ina similar way to that already explained for the picture stamps.

FIG. 21 provides a schematic representation of a post production processin which the audio/video material is edited to produce an audio/videoprogram. As shown in FIG. 21 the metadata, which may include picturestamps and/or the speech content information is communicated from theacquisition unit 152 via a separate route represented by a broken line174, to a metadata database 176. The route 174 may be representative ofa wireless communications link formed by for example UMTS, GSM or thelike.

The database 176 stores metadata to be associated with the audio/videomaterial. The audio/video material in high quality form is recorded ontothe tape 126. Thus the tape 126 is transported back to the editing suitewhere it is ingested by an ingestion processor 178. The tapeidentification (tape ID) recorded onto the tape 126 or other metadataproviding an indication of the content of the audio/video material isused to associate the metadata stored in the data store 176 with theaudio/video material on the tape as indicated by the broken line 180.

As will be appreciated although the example embodiment of the presentinvention uses a video tape as the recording medium for storing theaudio/video signals, it will be understood that alternative recordingmedium such as magnetic disks and random access memories may also beused.

B Box—FIGS. 22 to 27

FIG. 22 provides a schematic representation of a post production processin which the audio/video material is edited to produce an audio/videoprogram. As shown in FIG. 7 the metadata, which may include picturestamps and/or the speech content information is communicated from theacquisition unit 152 via a separate route represented by a broken line174, to a metadata database 176. The route 174 may be representative ofa wireless communications link formed by for example UMTS, GSM or thelike.

The database 176 stores metadata to be associated with the audio/videomaterial. The audio/video material in high quality form is recorded ontothe tape 126. Thus the tape 126 is transported back to the editing suitewhere it is ingested by an ingestion processor 178. The tapeidentification (tape ID) recorded onto the tape 126 or other metadataproviding an indication of the content of the audio/video material isused to associate the metadata stored in the data store 176 with theaudio/video material on the tape as indicated by the broken line 180.

The ingestion processor 178 is also shown in 22 to be connected to anetwork formed from a communications channel represented by a connectingline 182. The connecting line 182 represents a communications channelfor communicating data to 10 items of equipment, which form aninter-connected network. To this end, these items of equipment areprovided with a network card which may operate in accordance with aknown access technique such as Ethernet, RS422 and the like.Furthermore, as will be explained shortly, the communications network182 may also provide data communications in accordance with the SerialDigital Interface (SDI) or the Serial Digital Transport Interface(SDTI).

Also shown connected to the communications network 182 is the metadatadatabase 176, and an audio/video server 190, into which the audio/videomaterial is ingested. Furthermore, editing terminals 184, 186 are alsoconnected to the communications channel 182 along with a digitalmulti-effects processor 188.

The communications network 182 provides access to the audio/videomaterial present on tapes, discs or other recording media which areloaded into the ingestion processor 178.

The metadata database 176 is arranged to receive metadata via the route174 describing the content of the audio/video material recorded on tothe recording media loaded into the ingestion processor 178.

As will be appreciated although in the example embodiment a video tapehas been used as the recording medium for storing the audio/videosignals, it will be understood that alternative recording media such asmagnetic disks and random access memories may also be used, and thatvideo tape is provided as an illustrative example only.

The editing terminals 184, 186 digital multi-effects processor 188 areprovided with access to the audio/video material recorded on to thetapes loaded into the ingestion processor 178 and the metadatadescribing this audio/video material stored in the metadata database 176via the communications network 182. The operation of the ingestionprocessor with 178 in combination with the metadata database 176 willnow be described in more detail.

FIG. 23 provides an example representation of the ingestion processor178. In FIG. 23 the ingestion processor 178 is shown to have a jogshuttle control 200 for navigating through the audio/video materialrecorded on the tapes loaded into video tape recorders/reproducersforming part of the ingestion processor 178. The ingestion processor 178also includes a display screen 202 which is arranged to display picturestamps which describe selected parts of the audio/video material. Thedisplay screen 202 also acts as a touch screen providing a user with thefacility for selecting the audio/video material by touch. The ingestionprocessor 178 is also arranged to display all types of metadata on thescreen 202 which includes script, camera type, lens types and UMIDs.

As shown in FIG. 24, the ingestion processor 178 may include a pluralityof video tape recorders/reproducers into which the video tapes ontowhich the audio/video material is recorded may be loaded in parallel. Inthe example shown in FIG. 24, the video tape recorders 204 are connectedto the ingestion processor 178 via an RS422 link and an SDI IN/OUT link.The ingestion processor 178 therefore represents a data processor whichcan access any of the video tape recorders 204 in order to reproduce theaudio/video material from the video tapes loaded into the video taperecorders. Furthermore, the ingestion processor 178 is provided with anetwork card in order to access the communications network 182. As willbe appreciated from FIG. 249 however, the communications channel 182 iscomprised of a relatively low band width data communications channel182′ and a high band width SDI channel 182″ for use in streaming videodata. Correspondingly, therefore the ingestion processor 178 isconnected to the video tape recorders 204 via an RS422 link in ordercommunicate requests for corresponding items of audio/video material.Having requested these items of audio/video material, the audio/videomaterial is communicated back to the ingestion processor 178 via an SDIcommunication link 206 for distribution via the SDI network. Therequests may for example include the UMID which uniquely identifies theaudio/video material item(s).

The operation of the ingestion processor in association with themetadata database 176 will now be explained with reference to FIG. 25.In FIG. 25 the metadata database 176 is shown to include a number ofitems of metadata 210 associated with a particular tape ID 212. As shownby the broken line headed arrow 214, the tape ID 212 identifies aparticular video tape 216, on which the audio/video materialcorresponding to the metadata 210 is recorded. In the example embodimentshown in FIG. 25 the tape ID 212 is written onto the video tape 218 inthe linear time code area 220. However it will be appreciated that inother embodiments, the tape ID could be written in other places such asthe vertical blanking portion. The video tape 216 is loaded into one ofthe video tape recorders 204 forming part of the ingestion processor178.

In operation one of the editing terminals 184 is arranged to access themetadata database 176 via the low band width communications channel 182′the editing terminal 184 is therefore provided with access to themetadata 210 describing the content of the audio/video material recordedonto the tape 216. The metadata 210 may include such as the copyrightowner “BSkyB”, the resolution of the picture and the format in which thevideo material is encoded, the name of the program, which is in thiscase “Grandstand”, and information such as the date, time and audience.Metadata may further include a note of the content of the audio/videomaterial.

Each of the items of audio/video material is associated with a UMID,which identifies the audio/video material. As such, the editing terminal184 can be used to identify and select from the metadata 210 the itemsof audio/video material which are required in order to produce aprogram. This material may be identified by the UMID associated with thematerial. In order to access the audio/video material to produce theprogram, the editing terminal 184 communicates a request for thismaterial via the low band width communications network 182. The requestincludes the UMID or the UMIDs identifying the audio/video materialitem(s). In response to the request for audio/video material receivedfrom the editing terminal 184, the ingestion processor 178 is arrangedto reproduce selectively these audio/video material items identified bythe UMID or UMIDs from the video tape recorder into which the videocassette 216 is loaded. This audio/video material is then streamed viathe SDI network 182″ back to the editing terminal 184 to be incorporatedinto the audio/video production being edited. The streamed audio/videomaterial is ingested into the audio/video server 190 from where theaudio/video can be stored and reproduced.

FIG. 26 provides an alternative arrangement in which the metadata 210 isrecorded onto a suitable recording medium with the audio/video material.For example the metadata 210 could be recorded in one of the audiotracks of the video tape 218′. Alternatively, the recording medium maybe an optical disc or magnetic disc allowing random access and providinga greater capacity for storing data. In this case the metadata 210 maybe stored with the audio/video material.

In a yet further arrangement, some or all of the metadata may berecorded onto the tape 216. This may be recorded, for example, into thelinear recording track of the tape 218. Some metadata related to themetadata recorded onto the tape may be conveyed separately and stored inthe database 176. A further step is required in order to ingest themetadata and to this end the ingestion processor 178 is arranged to readthe metadata from the recording medium 218′ and convey the metadata viathe communications network 182′ to the metadata database 176. Therefore,it will be appreciated that the metadata associated with the audio/videomaterial to be ingested by the ingestion processor 178 may be ingestedinto the database 176 via a separate medium or via the recording mediumon which the audio/video material is also recorded.

The metadata associated with the audio/video material may also includepicture stamps which represent low quality representations of the imagesat various points throughout the video material. These may be presentedat the touch screen 202 on the ingestion processor 178. Furthermorethese picture stamps may be conveyed via the network 182′ to the editingterminals 184, 186 or the effects processor 188 to provide an indicationof the content of the audio/video material. The editor is thereforeprovided with a pictorial representation for the audio/video materialand from this a selection of an audio/video material item may be made.Furthermore, the picture stamp may be stored in the database 176 as partof the metadata 210. The editor may therefore retrieve a selected itemfor the corresponding picture stamp using the UMID which is associatedwith the picture stamp.

In other embodiments of the invention, the recording medium may not havesufficient capacity to include picture stamps recorded with theaudio/video material. This is likely to be so if the recording medium isa video tape 216. It is particularly appropriate in this case, althoughnot exclusively so, to generate picture stamps before or duringingestion of the audio/video material.

Returning to FIG. 22, in other embodiments, the ingestion processor 178may include a pre-processing unit. The pre-processing unit embodiedwithin the ingestion processor 178 is arranged to receive theaudio/video material recorded onto the recording medium which, in thepresent example is a video tape 126. To this end, the pre-processingunit may be provided with a separate video recorder/reproducer or may becombined with the video tape recorder/reproducer which forms part of theingestion processor 178. The pre-processing unit generates picturestamps associated with the audio/video material. As explained above, thepicture stamps are used to provide a pictorial representation of thecontent of the audio/video material items. However in accordance with afurther embodiment of the present invention the pre-processing unitoperates to process the audio/video material and generate an activityindicator representative of relative activity within the content of theaudio/video material. This may be achieved for example using a processorwhich operates to generate an activity signal in accordance with ahistogram of colour components within the images represented by thevideo signal and to generate the activity signals in accordance with arate of change of the colour histogram components. The pre-processingunit then operates to generate a picture stamp at points throughout thevideo material where there are periods of activity indicated by theactivity signal. This is represented in FIG. 27. In FIG. 27A picturestamps 224 are shown to be generated along a line 226 which isrepresenting time within the video signal. As shown in FIG. 27A thepicture stamps 224 are generated at times along the time line 226 wherethe activity signal represented as arrows 228 indicates events ofactivity. This might be for example someone walking into and out of thefield of view of the camera where there is a great deal of motionrepresented by the video signal. To this end, the activity signal mayalso be generated using motion vectors which may be, for example, themotion vectors generated in accordance with the MPEG standard.

In other embodiments of the invention, the pre-processor may generatetextual information corresponding to speech present within the audiosignal forming part of the audio/video material items stored on the tape126. The textual information may be generated instead of the picturestamps or in addition to the picture stamps. In this case, text may begenerated for example for the first words of sentences and/or the firstactivity of a speaker. This is detected from the audio signals presenton the tape recording or forming part of the audio/video material. Thestart points where text is to be generated is represented along the timeline 226 as arrows 230. Alternatively the text could be generated at theend of sentences or indeed at other points of interest within thespeech.

At the detected start of the speech, a speech processor operates togenerate a textual representation of the content of the speech. To thisend, the time line 226 shown in FIG. 27B is shown to include the text232 corresponding to the content of the speech at the start of activityperiods of speech.

The picture stamps and textual representation of the speech activitygenerated by the pre-processor is communicated via the communicationschannel 182 to the metadata database 176 and stored. The picture stampsand text are stored in association with the UMID identifying thecorresponding items of audio/video material from which the picturestamps 224 and the textual information 232 were generated. Thistherefore provides a facility to an editor operating one of the editingterminals 184, 186 to analyse the content of the audio/video materialbefore it is ingested using the ingestion processor 178. As such thevideo tape 126 is loaded into the ingestion processor 178 and thereafterthe audio/video material can be accessed via the network communicationschannel 182. The editor is therefore provided with an indication, veryrapidly, of the content of the audio/video material and so may ingestonly those parts of the material, which are relevant to the particularmaterial items required by the editor. This has a particular advantagein improving the efficiency with which the editor may produce anaudio/video production.

In an alternative embodiment, the pre-processor may be a separate unitand may be provided with a screen on which the picture stamps and/ortext information are displayed, and a means such as, for example, atouch screen, to provide a facility for selecting the audio/videomaterial items to be ingested.

In a further embodiment of the invention, the ingestion processor 178generates metadata items such as UMIDs whilst the audio/video materialis being ingested. This may required because the acquisition unit in thecamera 152 is not arranged to generate UMIDs, but does generate a UniqueMaterial Reference Number (MURN). The MURN is generated for eachmaterial item, such as a take. The MURN is arranged to be considerablyshorter than a UMID and can therefore be accommodated within the lineartime code of a video tape, which is more difficult for UMIDs becausethese are larger. Alternatively the MURN may be written into a TELEFILE®label of the tape. The MURN provides a unique identification of theaudio/video material items present on the tape. The MURNs may becommunicated separately to the database 176 as indicated by the line174.

At the ingestion processor 178, the MURN for the material items arerecovered from the tape or the TELEFILE label. For each MURN, theingestion processor 178 operates to generate a UMID corresponding to theMURN. The UMIDs are then communicated with the MURN to the database 176,and are ingested into the database in association with the MURNs, whichmay be already present within the database 176.

Tape IDs in Time Code-FIGS. 28 to 30

Referring to FIG. 28, a tape format is shown schematically. Video andaudio information is recorded in helical tracks of which a set of, e.g.10 or 12, tracks records one field of video. The helical tracks includevertical interval time codes (VITC). The time codes may be duplicated ina linear time code track LTC, but the contents of the VITC and LTC maybe different. The tape may comprise at least one other linear track (notshown). In this illustrative description it is assumed that all video,audio and other information is recorded digitally. However, the videoand audio may be recorded as analogue information. The video and audioinformation may be compressed according to the MPEG 2 standard forexample.

The time codes are recorded once per video field. As schematically shownin FIG. 29, a known time code has 80 bits of which 16 are reserved forsynchronisation information, 32 for time code bits and 32 for userdefined bits, herein referred to as “user bits”. The user bits areinterleaved with the other bits in a typical time code; however theinvention is not limited to that.

Tape IDs and UMIDs

UMIDs are described in the section UMIDs. They are material identifierswhich are universally unique. In embodiments of the present inventionthey are used to bind material i.e. video and/or audio recorded on thetape to metadata which is stored in for example a database 464 as shownin 30.

Embodiments of the present invention record, on the tape, TapeIdentifiers (Tape IDs) having most preferably 64 bits. Tape IDs may haveother numbers of bits for example in the range 32 to 64 bits. Unlike aUMID which is universally unique, a Tape ID may not be universallyunique but is unique to at least an organisation such as a productioncompany. The Tape ID is recorded in the user bits of the linear timecode. If it has 64 bits it occupies two time codes. It thus refers toone frame of two video fields. In preferred embodiments the same tape IDis repeated every frame. Preferably, the tape is “prestriped” before useto record linear time codes for the fields.

The format of an illustrative Tape ID is any 4 byte hex number as set bythe user-bit set-up controls on the VTR or camcorder.

Linking to a UMID

The Tape ID may not be unique. In embodiments of the present invention,a Tape ID is linked to a UMID which uniquely identifies the materialrecorded on the tape. The UMID is used to link the material on the tapeto other metadata relating to the material. If only one piece ofmaterial is recorded on a tape, then only the Tape ID needs to be linkedto the UMID which uniquely identifies that one piece of material.However, in practice two or more pieces of material would be recorded.For example, the tape may contain two or more takes of the same shot:each take is one piece of material and has its own UMID. Thus to linkeach UMID to each piece of material, the Tape ID plus the IN (start) andOUT (end) time codes of the piece of material are used.

Linking to a Database

It is desirable to provide more detailed metadata relating to thematerial recorded on the tape. Examples of such metadata are describedin the section Camera Metadata. Thus metadata is stored in a database,the UMID linking the metadata to the material.

Illustrative System

Referring to FIG. 30, a digital video source, e.g. a camcorder 460 has amultiplexer 463 which in known manner inserts the Tape ID and the IN andOUT time codes onto a tape. The IN and OUT time codes are generated eachtime a record start and stop button 471 is operated. The tape ID isgenerated as follows: —

The camcorder records a contiguous set of time codes for all fields; thetape ID is fixed, recorded in the time code user bits and is preset bythe user bit controls. The camera also outputs audio A, video V onrespective outputs.

The camera has a signal processor termed herein the A-BOX which storestime code snap shots at the beginning and end of a recording, i.e. theIN and OUT points. The user bits form part of the time code and thus thetape ID is monitored by monitoring the user bits, whereby the tape Idsare stored with the IN and Out points. The A-box derives the user bitsof the time codes from the tape and transfers them to a data processorwhich in this example is a PDA (Personal Digital Assistant) 469. TheA-Box is described in more detail in the section A-BOX. It may deriveother metadata from the camera and/or material recorded on the tape andtransfer it to the PDA 469.

The PDA 469 links the Tape ID and the IN and OUT time codes of thepieces of material recorded on the tape to one or more UMIDs. The PDAhas a data entry device 468, for example a keyboard, to enter data andmay have, or be connected to, a GPS device 470 for producing the spatialco-ordinate data of an extended UMID. The PDA generates the UMID andassociates it with the Tape ID and the IN and OUT codes. The PDA 469transfers the UMIDs, Tape IDs, IN and Out points, and any other metadatagenerated at the camera and/or PDA, to a database 464.

The database 464 in this example comprises a data base program run on astandard personal computer (PC) or a lap-top computer having a keyboard467 for data entry, a display 465 and a systems unit 466.

The database 464 stores more extensive and detailed metadata, includingthe UMID(s), the tape IDs, the IN and OUT points and other metadatagenerated at the Camera 460, the PDA 469 and/or the data entry device467. The Tape IDs and the IN and OUT points on the tape and the UMID(s)in the database allow clear and unique linking of the material on thetape, and of the tape on which the material is recorded, to the data inthe database.

Metadata, which is additional to the UMID, may be entered into the PDA469 by the operator using the keyboard 468. A computer 461 in the PDAgenerates the UMID (whether basic or extended or having the data-reducedstructure as shown in FIG. 13 of the section UMIDs) and formats theother metadata into a suitable data structure for transfer to thedatabase 464.

Interconnecting the Camera, PDA and Database.

Data transfer between the A-box and PDA may be by corded or wirelesslink. For example the PDA may have in Infra Red port for the transfer ofdata linking with a corresponding Infra-Red port on the A-Box. Likewisethe PDA may be linked to the database by a corded or wireless link. Thelink from the PDA to the database may be via a telephone link, or bydirect radio link. The PDA may be linked to the database via theinternet.

Modifications.

The ‘A-BOX’ and the PDA 469 are shown as items separate from the camera460. The A-box may be replaced by a processor, e.g. a computer builtinto the camera. Alternatively both the a-Box and the PDA may bereplaced by a processor built into the camera.

Whilst the first aspect of the invention has been described by way ofexample with reference to tape, the invention may be applied to otherrecording media. For example tapes may be replaced by discs such asoptical or magneto-optical discs or by computer hard discs.

Modifications.

Although the foregoing description describes the embodiments of thefirst aspect of the invention in relation to video material, theinvention may be applied to audio material and/or to data.

Although the foregoing description describes the embodiments of theinvention in relation to material recorded on a recording medium, andthe MURNs are applied to recorded material, the MURNs may be embedded inmaterial from a live source and transmitted to a processor or to atransmission and distribution network as streamed and unrecordedmaterial.

Second Aspect of the Invention

MURNs in Time Code—FIGS. 28, 29 and 31

Embodiments of the second aspect of the invention will now be describedwith reference to:

FIG. 28 which schematically illustrates a known tape format;

FIG. 29 which schematically illustrates a time code; and

FIG. 31 which schematically illustrates a digital camcorder.

Referring to FIG. 28, a tape format is shown schematically. Video andaudio information is recorded in helical tracks of which a set of, e.g.10 or 12, tracks records one field of video. The helical tracks includevertical interval time codes (VITC). The time codes may be duplicated ina linear time code track LTC, but the contents of the VITC and LTC maybe different. The tape may comprise at least one other linear track (notshown). In this illustrative description it is assumed that all video,audio and other information is recorded digitally. However, the videoand audio may be recorded as analogue information. The video and audioinformation may be compressed according to the MPEG 2 standard forexample.

The time codes are recorded once per video field. As schematically shownin FIG. 29, a known time code has 80 bits of which 16 are reserved forsynchronisation information, 32 for time code bits and 32 for userdefined bits, herein referred to as “user bits”. The user bits areinterleaved with the other bits in a typical time code; however theinvention is not limited to that.

The present embodiment involves the recording of locally unique materialidentifiers, to be referred to as MURNs (material unique referencenumber) in the time code user bits.

Unlike, for example, an SMPTE UMID which is a globally unique referencefor a piece of material (but also takes up between 32 and 64 bytes ofstorage), a MURN can be much shorter—perhaps 16 bits as a typicalexample. The MURN has only to be unique within an individual tape. Then,in conjunction with a tape identification (perhaps written or printed onthe tape, recorded in some form in the TC user bits or stored on atelefile—see below) a piece of recorded material can be identifiedwithin an organisation in order to map that material onto a globallyunique UMID for later reference.

Therefore, the MURN simply needs to be a number which increments,decrements or otherwise varies from material to material on the tape. Aslong as the same MURN is not re-used for two pieces of material on thesame tape, the local uniqueness requirement is fulfilled.

Referring to FIG. 31, a camcorder 460 comprises a video and audio pickuparrangement 462 (e.g. a CCD image pickup device and a microphone)outputting data audio (A) and video (V) data streams, a MURN generator464, a multiplexer 466 and a tape recording arrangement 468.

The MURN generator can take many forms, and serves to generate MURNs asdescribed above. Each time the camera starts recording (i.e. at eachvideo in-point) a new MURN is generated by the MURN generator forrecording on the TC user bits relating to that piece of material.

The MURN generator operates in conjunction with a “telefile”™ memory 470associated with the tape medium. The telefile is a non-volatile memorydevice (e.g. a flash memory device) permanently attachable to the tapecase 472 (e.g. a cassette) and which can be remotely interrogated—atleast in a non-contact fashion—by a reader/writer 474 for example usingmagnetic induction for power and data transmission. The telefile stores,amongst other possible date items, a tape identifier and the highestvalue of MURN (on a 16 bit count) previously used.

So, when a new MURN needs to be generated, the MURN generatorinterrogates the telefile to find the highest previously used MURNvalue, increments it and uses that as the new MURN value, writing thenew MURN value back to the telefile.

The MURN generated by the MURN generator is passed, with the video andaudio data streams and (if used) good shot markers and the like, to themultiplexer 466 for recording on the tape.

The multiplexer arranges the MURN data and the GSM flags (and any othersuch data) into the time code user bits for both the LTC and the VITC.These user bits are then passed to the tape transport in a conventionalway for recording on the tape. Accordingly, embodiments of the inventionprovide for the material identifying code to be stored effectivelytwice—in the LTC and in the VITC. This provides for a reliable replay ofthe material identifying code across a range of replay speeds from jog(single frame movement, where the VITC is more successfully read) toshuttle (where the LTC is more successfully read).

Embodiments of the invention also extend to a tape replay devicearranged to recover the MURN data from either or both of the LTC andVITC. Such a device may be substantially as drawn in FIG. 31, but with ademultiplexer performing the above operation in place of the multiplexer466.

Third Aspect of the Invention

Recording Metadata on Tape—FIGS. 32 to 41

There are a variety of apparatus in which audio/video informationsignals are recorded onto a linear recording medium. To illustrateexample embodiments of the present invention, a video camera will beused as an example of such an apparatus. However as will be appreciatedthere are other examples of recording apparatus which could be used andin which information signals are recorded onto a linear recordingmedium.

An illustration of a video camera is shown in FIG. 32. In FIG. 32 thevideo camera is shown to comprise a lens arrangement 1 connected to acamera body 2. Also connected to the camera body 2 is a control unit 4and a view finder 6. Also forming part of the camera is a microphone 8which is coupled to a sound processor 10. The video camera operates togenerate video signals representative of images falling on an imagesensor 12. The lens arrangement 1 operates to focus images fallingwithin the field of view of the lens arrangement 1 onto the image sensor12. The image sensor 12 is connected to a video processor 14. The soundprocessor 10 and the video processor 14 feed audio and video signalsrespectively to a recorder 16. The recorder 16 operates to record theaudio/video signals onto a magnetic tape which is loaded into the videocamera, although this is not shown in FIG. 32. The video camera body 2is also shown to include a metadata generation processor 20 which isconnected to the recorder 16 by a connecting channel 22. The video andaudio signals are received by the recorder 16 via two further connectingchannels 24, 26. A better understanding of the operation of the recorder16 which is also arranged to reproduce recorded audio and video signalsmay be gathered from a more detailed block diagram of the body 2 of thevideo camera, which shows the recorder 16 and the metadata generationprocessor 20 where parts also appearing in FIG. 32 bear identicalnumerical designations.

In FIG. 33 the recorder 16 is shown to include a tape drive 30 having adrive head arrangement 32 which is configured to convert audio and videosignals as well as metadata into a form in which they can be recordedonto a magnetic tape 34. Signals representative of metadata are producedby a read/write control processor 36 which is coupled to the drive head32 via a connecting channel 38. To provide the read/write controlprocessor 36 with a temporal reference, a frame clock 40 is connected toa first input 42 of the control processor 36. The read/write controlprocessor 36 also receives from an external channel 44 signalsrepresentative of externally generated metadata. At a third input 48 theread/write control processor 36 receives signals representative ofinternally generated metadata from the metadata generation processor 20.The control processor 36 is also provided with a data store 50 which iscoupled to the control processor 36 via a bi-directional connectingchannel 52. The metadata generation processor 20 is shown in FIG. 33 toreceive an input from a clock 54 and three further input sensors 56, 58,60. The clock 54 provides the metadata generation processor with atemporal reference at which the video and audio signals generated by theaudio processor and the video processor 10, 14 respectively areproduced. The three sensors 56, 58, 60 serve to generate differentsignals representative of values of parameters of the camera which wereuse to capture the audio and video signals. For example the sensor 56could provide signals representative of the “F-stop” or aperture settingof the lens arrangement 1 whereas the second sensor 56 could provide anindication of a frame rate of the video camera. The third sensor 60could provide an indication of a “good shot marker” which is manuallyset by the operator of the camera when a good image or shot has beenrecorded by the camera.

The read/write control processor 36 is arranged in operation to controlthe tape drive 30 so that the audio and video signals are recorded ontothe magnetic tape 34. However according to the example embodiment of thepresent invention the read/write control processor 36 also operates torecord metadata associated with the audio/video signals onto themagnetic tape 34.

In order to better appreciate and understand the example embodiments ofthe present invention a brief description and explanation will be givenof the way in which audio/video signals are recorded onto the magnetictape 34 along with a time code and an amount of storage capacityallocated to user specified bits. The user specified bits are an exampleof an information field which is available to the user and which can beused to for example record metadata. To this end FIG. 34 provides anillustration of an arrangement of read/write heads 33 which form part ofthe drive heads 32 in more detail. In FIG. 34A the drive heads 32 areshown to include a rotating head 70 and a linear head 72. FIG. 34A showsthe tape read/write heads 33 as a plan view so that the linear recordinghead 72 is represented as being below the rotating head 70. As shown inFIG. 34A the magnetic recording tape 34 is wrapped in the shape of theletter omega (Ω) around the rotating head. In this way the magnetic tapeis fed past the rotating head 70 and the linear head 72 by the tapedrive 30 at a rate which is determined by the band width of the videoand audio signals being recorded and properties of the magnetic tape torepresent this band width. As already indicated, the rate at which thetape is driven during recording is the recording rate. FIG. 34B shows anelevation view of the read/write heads 33 and as confirmed in this viewthe linear recording head is disposed underneath the rotating head 70.

The operation of the read/write heads 33 will now be explained withreference to FIG. 35 in which the magnetic tape 34 is shown with areas76, 78 of the tape shown where information has been recorded. In FIG. 35the magnetic tape 34 is shown to have recorded on the surface aplurality of helical scan tracks 76 which are disposed at an angle tothe linear access at which the magnetic tape is moved past theread/write heads 32. The direction of movement and the linear axis ofthe magnetic tape are indicated by the arrow 74. In FIG. 35 the helicalscan tracks 76 are formed by the rotating head 70 which operates torecord the audio and video signals in the helical scan tracks which arearranged at an angle to the linear access of the magnetic tape 34 sothat the band width of the video and audio signals can be accommodatedwithin the band width which the properties of the magnetic tape canaccommodate. Also shown in FIG. 35 are further areas below the helicalscan tracks 76 which are representative of the areas on the magnetictape where the linear head 72 records the user specified bits (USB)which also include a time code (TC). These linear tracks 78 which arerecorded along the linear access of the magnetic tape 34. This isbecause the band width allocated to the USB and TC information isconsiderably less than that of the video and audio signals.

As will be appreciated by those skilled in the art, the read/write heads33 can function both to record the video and audio information signalsonto the magnetic tape 34 in the helical scan tracks 76 as well as theUSB and TC information in the linear tracks 78 and correspondingly toread or reproduce the information recorded into the magnetic tape 34.When reproducing information recorded onto a magnetic tape it is oftennecessary to scan or fast forward through the audio and videoinformation in order to navigate through the content of the audio andvideo information signals. As explained above, this is also known by theterm ‘shuttle’ mode reproduction. In this mode although the rotatinghead 70 is arranged to rotate at the same speed as that when audio andvideo information signals were recorded onto the magnetic tape 34 sothat the head follows the helical scan tracks at the angle at which thetracks were created with respect to the linear axis of the magnetictape, during shuttle mode the magnetic tape is driven by the tape driveat a greater speed. As a result the rotating head passes over more thanone helical scan track, the number of tracks that the rotating read headpasses during any one rotation being determined by the amount by whichthe rate of feeding the linear tape during the reading rate is greaterthan the recording rate when the information was recorded. This isillustrated in FIG. 36A.

In FIG. 36A the magnetic tape 34 shown in FIG. 35 is shown to includedesignated areas shown as bold boxes 80 in FIG. 36A which arerepresentative of the area of the magnetic tape from which informationcan be recovered by the rotating head 70 at a particular time. In FIG.36A the arrow 74 which is representative of the rate at which themagnetic tape is fed past the read/write heads 70, 72 is shown to betwice that of the recording rate shown in FIG. 35 which is representedby a reference “×2”. As shown in FIG. 36A, a result of the increase inspeed of the magnetic tape is that instead of the reproduction area 80following the linear tracks 76, the reproduction area 80 moves from thebottom of one track to the top of the subsequent track. This isrepresented by the arrow 82. Correspondingly, a further reproductionarea of the linear recording heads 72 is represented as a second boldbox 84. Since the rotating head 70 and the linear recording head 72 arestationary and have a fixed relationship with respect to one another,the second reproduction area 84 which is representative of the area onthe tape from which the linear read head 72 can recover informationmoves in correspondence with that of the first reproduction area 80.Therefore correspondingly with the times two (×2) speed up shown in FIG.36A, the amount of information which can be recovered from the lineartracks 78 is correspondingly reduced because conventionally theinformation is recovered at normal read speed. As such, although therewould be no loss of information from the linear recording track at up tothirty to forty times shuttle speed, conventionally a data processorexecuting software is provided to read the data recovered from thelinear recording track. The data processor would loose information at arate of twice shuttle speed (×2) because the data processor is arrangedto recover data read from the linear recording track at, for example,normal reading rate. As a result information from these areas will belost. Thus a reduced amount of the USB will be recovered from themagnetic tape 34.

FIGS. 36B and 36C reproduce the representation shown in FIG. 36A exceptthat FIG. 36B shows a times four×4 speed up and FIG. 36C shows a timeseight x8 speed up. As will be appreciated the amount of informationwhich can be recovered in the times four and times eight speed ups shownby FIGS. 36B and 36C will be proportionally reduced and this isindicated by the reproduction areas 80 shown at three positions as therotating head moves at the same rate as was used to record theinformation.

Returning to FIG. 33 the operation of example embodiments of the presentinvention will now be explained. As will be appreciated from theforegoing discussion during shuttle mode the amount of USB informationwhich may be recovered from the magnetic tape by the recorder 16 will bereduced in proportion to the amount by which the reading rate at whichthe magnetic tape is being shuttled past the read/write heads which isgreater than the recording rate at which the magnetic tape was drivenpast the read/write heads when the information signals and USB wererecorded onto the magnetic tape 34. The read/write control processor 36is arranged in operation to write the metadata fed from the externalconnecting channel 46 and the metadata generation processor 20 via theconnecting channel 48 onto the magnetic tape in the areas which aredesignated for the USB information. In general the write controlprocessor 36 operates to write repeatedly metadata onto the magnetictape 34 in the USB areas 78 the number times the metadata is repeatedbeing determined by the relative importance of the metadata.

As already explained there are various type of metadata which vary intheir value and importance. For example a UMID would be of considerablyhigher value than the aperture setting or ‘F-stop’ value of the camerawhen the video signals were generated. Therefore by recording the moreimportant metadata more often onto the magnetic tape than the lessimportant metadata, an inherent increase in a probability of correctlyrecovering and reproducing the more important metadata will be produced.

According to a first embodiment of the present invention the read/writecontrol processor 36 receives the metadata from the connecting channels46, 48 and identifies the metadata as being associated with apredetermined different number of metadata types. The type may be forexample the UMID, ‘F-stop’, time or location. This metadata is formedinto metadata objects which may consist of a defined number of bytes ora single byte of metadata of the predetermined type. As shown in FIG.37, according to the first embodiment of the present invention theread/write control processor 36 operates, after identifying the metadataobjects, to form the metadata objects into packets. In FIG. 37, boxes 90represent metadata objects which are different. The metadata objects arefed to a packet forming processor 92 which operates within theread/write control processor 36. The packet forming processor 92 thengenerates a metadata packet 94 by combining selected ones of themetadata objects and writing the metadata objects 90 into respectivefields 96, 98, 100, 102 of the metadata packet 94. The read/writecontrol processor 36 then generates header information which is writtento a header 104 of the metadata packet 94.

FIG. 38 provides a representation of an arrangement of metadata objectsof the read/write control processor 36 when operating in accordance withthe first embodiment of the present invention. The representation shownin FIG. 38 is an illustration of the arrangement in which metadata isrepeatedly recorded and does not represent the physical layout ofmetadata as recorded onto the magnetic tape 34. The representationprovided in FIG. 38 is therefore a conceptual form of one example of thefirst embodiment of the present invention which facilitatesunderstanding. In FIG. 38, a plurality of metadata packets 94 arerepresented as columns within a matrix in which the rows each representdifferent fields of the metadata packet. In a preferred embodiment, eachof the cells of the matrix represents a byte which is written onto theUSB areas of the magnetic tape by the read/write heads 32 under controlof the read/write control processor 36. As shown in FIG. 38 the metadataobjects which form the second, third and fourth rows of the first fourcolumns have the same values. These values represent, as anillustration, the letters of the word RED, each letter representing adifferent metadata object so that each letter is repeated four times.This is in accordance with the relative importance of this information.In the next four columns, the first three letters of the word GREEN arerepeat recorded in the fields of these four metadata packets. As will beseen however in FIG. 38, the header information has changed from thefirst four to the subsequent four metadata packets in order to indicateto a reproducing apparatus that the metadata packets have changed. Thisis because during a shuttle mode in which for example the reading rateis two times that of the recording rate×2, two out of four metadatapackets will be recovered. In this case, in order to provide areproducing apparatus with a means for distinguishing different metadatapackets from each other or to identify the same metadata packets whichhas been repeat recorded and recovered from the magnetic tape, theheader information is provided. Thus, as shown in FIG. 38, the headerinformation is arranged to be different between different metadatapackets which are successively recorded onto the magnetic tape. For themetadata objects which make up the words RED and GREEN, the headers arethe numbers ‘0’ and ‘1’ respectively. As shown in FIG. 38, the thirdgroup of four repeated metadata packets 108 has a further differentheader information and in this case is given the number ‘2’. The fieldsof theses metadata packets include the last two letters of the wordGREEN. The subsequent fourth group of four metadata packets are showneach to have a different value in the information header which arerepresented as numbers ‘0’, ‘1’, ‘2’ and ‘3’.

The fourth group of metadata packets 110 represent metadata which is ofrelatively low importance compared to the words GREEN and RED recordedin the earlier metadata packets. As such in the fourth group of metadatapackets, the metadata objects are recorded only once within the fieldsof the metadata packet. The information represented in the fourth groupof metadata packets 110 is the date 01.11.99 and the time 13:28. Againthe header information indicates that each of the metadata packets inthe fourth group 110 contains different metadata objects. As will beappreciated by comparing the recording arrangement represented by thematrix shown in FIG. 38, a reproducing apparatus will be able to recoverthe most important metadata represented by the words RED and GREENprovided the rate of reading the magnetic tape during a shuttle mode isnot greater than times four.

In order to recover the metadata which has been repeat recorded inaccordance with the first embodiment of the present invention, in apreferred embodiment the recorder 16 is arranged in reverse so that theread/write heads operate to read information to detect the metadatapackets and to distinguish and determine whether the same metadatapackets have been recovered or whether these are different metadatapackets. In this case the read/write control processor 36 would act as aread control processor. As will be appreciated the read/write controlprocessor may not be aware of the rate at which the tape drive isdriving the magnetic tape across the read/write heads 32. Therefore thecontrol processor 36 operates to detect the header information of themetadata packets which are recovered successively in accordance with theorder in which they were recorded. Therefore if in a group theread/write control processor 36 recovers a metadata packet and the nextrecovered metadata packet has the same header information, then theread/write control processor will discard the metadata packet having thesame header information. If however the next metadata packet containsdifferent header information then the read control processor will detectthat the subsequently detected metadata packet is a different packet andthat the metadata is different and therefore output the previousmetadata packet along with the audio/video information signals recoveredfrom the magnetic tape.

A further illustration of the method of recovering metadata, which hasbeen recorded onto the magnetic tape in accordance with the firstembodiment of the present invention, is illustrated by a flow diagramshown in FIG. 40. In FIG. 40 the first process step 120 is to recover ametadata packet, with the next process step 122 being to recover thenext metadata packet. At process step 124, the process determineswhether the header information has changed between the last metadatapacket and the previous metadata packet. If the header has not beenchanged then process step 126 is executed and the packet is discardedand the process continues from 122. If the header information haschanged then the process step 128 is executed and the previous metadatapacket is output. The process then continues from process step 122.

A second embodiment of the present invention will now be described inwhich the read/write control processor 36 operates to arrange themetadata and record the metadata repeatedly on the magnetic tape 34 inan arrangement which is represented by a table shown in FIG. 39. Therepresentation shown in FIG. 39 is an illustration of an arrangement inwhich metadata is repeatedly recorded and does not represent thephysical layout of metadata as recorded onto the magnetic tape 34. Therepresentation provided in FIG. 39 is therefore a conceptualrepresentation which facilitates understanding of one example of thesecond embodiment of the present invention. In accordance with thesecond embodiment of the present invention the read/write controlprocessor 36 arranges the metadata objects into a plurality ofcategories. These metadata objects are then repeatedly recorded a numberof times in dependence upon the relative importance of each of thecategories. In the example embodiment shown in FIG. 39 there are fourcategories. In the first category represented by the first row of thetable, the metadata objects are repeated eight times. Each of the cellsof the table again represents an area of the magnetic tape where the USB78 are recorded. As an example each cell may represent a byte ofinformation. Therefore as shown in FIG. 39 if the word RED is consideredto be of the highest importance then each of the letters whichrepresents this example metadata object is repeatedly recorded eighttimes so that the first eight cells of the first row contain the letter“R”. The next row of the table represents the next level of relativeimportance of the metadata. In this case the metadata objects arerepeated four times. Therefore for example if the word GREEN is anexample of metadata of this next level of relative importance, then eachof the metadata objects which are the letters of the word GREEN arerepeated four times therefore the first four cells of the row containthe letter “G” whereas the next four cells of the row contain the letter“R”. Similarly the next row of the matrix contains metadata of acorrespondingly reduced level of importance. In this example the nextlevel of metadata is represented by the word BLUE and in this row themetadata objects are repeated only twice. Therefore as shown in FIG. 39the first two cells of this row contain the word “B” the next two cellscontain the letter “L” and the next two cells contain the letter “U” andthe final two cells contain the letter “E”. Finally the final row of thetable in FIG. 39 contains the least important metadata which is againrepresented by the date 01.11.99 13:28. These metadata objects arerepeated only once. In operation the read/write control processor 36begins writing new objects of metadata with reference to the time codederived from the frame clock 40 which is coupled to the write controlprocessor 36.

A reproducing apparatus which operates to recover the metadata, whichhas been recorded onto the magnetic tape according to the secondembodiment of the second invention, operates as follows. The reproducingapparatus may be the recorder 16 but operated in a reverse mode in thatthe tape drive is now reading information from the magnetic tape so thatthe read/write heads are now reversed so that they recover informationrecorded onto the magnetic tape. The read/write control processor 36then operates as a read control processor. The control processorrecovers each of the metadata objects and from an order or a position atwhich the metadata objects were recorded onto the magnetic tape 34 inthe USB area 78, the read control processor is provided with anassociation of the recovered metadata objects with the categories inwhich they were recorded. This is provided with reference to the timecode, therefore time code provides an indication of the start of a newmetadata group of objects recorded onto the magnetic tape 34. In thisexample embodiment, the reproducing apparatus will operate to detect arelative rate at which the information signals are being read from themagnetic tape 34 in accordance with a rate at which the magnetic tape 34is being driven past the read/write heads 32 compared to the readingrate which is the speed at which the tape was driven when theinformation was being written onto and recorded onto the magnetic tape34. Therefore by comparing the reading rate to the recording rate, thecontrol processor 36 operates to calculate the number of metadataobjects which will be passed before the next metadata object can berecovered from the magnetic tape 34. By dividing this number by thenumber of times the metadata objects are repeated for each of thecategories, the read control processor is able to determine whether thesame object has been recovered from the magnetic tape and can thereforediscard redundant metadata objects which have been recovered repeatedly.Consider the example shown in FIG. 39. In this case if the read speed istwice that of the recording rate, then of the first row which isrepeated eight times, four of the objects will be recovered so that theread processor will have four “R”s. If however the reading rate is fourtimes that of the recording rate, then the control processor will onlyhave two “R”s. If the read rate is eight times the recording rate thenonly one “R” will be recovered. Similarly for the second row, if thereading rate is twice the recording rate then two “G”s will be recoveredfrom the first four cells of this category. If however the reading rateis four times the recording rate then only one “G” will be recovered. Ifhowever the reading rate is times eight then the “G” will not berecovered. Correspondingly it will be appreciated that the number oftimes the metadata objects are repeated determines the likelihood ofwhether these objects may be recovered during playback at differentshuttle reading rates.

A further understanding of the process of recovering metadata which hasbeen recorded in accordance with the second embodiment of the presentinvention is provided by a flow diagram which is shown in FIG. 41. InFIG. 41 a first step in the process 130 is to recover the metadataobjects from the USB areas 78. Necessarily, contemporaneously the timecode is also recovered with the USB information, which is represented asa process step 131. At the next process step 132 the categories to whichthe metadata objects were assigned when they were recorded aredetermined. At the next process step 134 a rate at which the readingrate of the metadata is determined with respect to a reading rate whichis the rate which the tape is being driven during reading mode. Atprocess step 134 the relative rate of reading is calculated by dividingthe reading rate by the recording rate. The next step 136 in theprocess, the pre-determined number of times the metadata objects wererepeatedly recorded for each category is divided by the relative readingrate calculated in step 134. A result of step 136 is to generate aredundancy number which is fed to process step 138. At step 138 allmetadata objects after one received following the first time code arediscarded to a number equal to the redundancy number.

As will be appreciated the first embodiment of the present invention hasan advantage in that the reproduction process for recovering themetadata is more efficient and more simple than the reproduction methodrequired for the metadata recorded in accordance with the secondembodiment of the present invention. However this advantage is gained ata disadvantage of requiring a redundant object to be added to themetadata packet which is the header information. In comparison thesecond embodiment of the present invention does not include anyredundant objects although repeat recording in accordance with preferredembodiment is arranged with reference to the time code provided by theframe clock 40.

As will be appreciated by those skilled in the art various modificationsmay be made to the embodiments herein before described without departingfrom the scope of the present invention. Furthermore it will beappreciated that the methods as herein before described may be embodiedand represented as instructions of a computer program. Furthermore thesystem and apparatus described herein before may form a suitablyprogrammed data processor operating in accordance with instructionsformed from a computer program.

Whilst the embodiments described above each include explicitly recitedcombinations of features according to different aspects of the presentinvention, other embodiments are envisaged according to the generalteaching of the invention, which include combinations of features asappropriate, other than those explicitly recited in the embodimentsdescribed above. Accordingly, it will be appreciated that differentcombinations of features of the appended independent and dependentclaims form further aspects of the invention other than those, which areexplicitly recited in the claims.

1. A method of recording video and/or audio material on a recordingmedium, comprising: generating an identifier for identifying a piece ofvideo and/or audio material recorded on the recording medium; generatingan activity indicator representative of relative activity within acontent of the video and/or audio material; and generating metadata inaccordance with the activity indicator.
 2. A method according to claim1, further comprising: indicating with the activity indicator a higherthan average degree of motion within the content of the video and/oraudio material when the material is video.
 3. A method according toclaim 1, further comprising: indicating with the activity indicator anevent of activity within the content of the video material when thematerial is video.
 4. A method according to claim 3, wherein indicatingincludes indicating with the activity indicator movement into or out ofa field of view of the content of the video material within the contentof the video material.
 5. A method according to claim 1, whereingenerating metadata in accordance with the activity indicator includesgenerating a pictorial representation of the content of the material inaccordance with the activity indicator when the material is video.
 6. Amethod according to claim 1, wherein generating metadata in accordancewith the activity indicator includes generating at least one of a startof a period of dialogue or a change in a scene of the content of thematerial in accordance with the activity indicator.
 7. A methodaccording to claim 6, wherein generating metadata in accordance with theactivity indicator includes generating at least one of an introductionof a new face or new face position within a scene of content of videomaterial in accordance with the activity indicator.
 8. A methodaccording to claim 1, wherein generating metadata in accordance with theactivity indicator includes generating text data representative ofspeech within the content of the material in accordance with theactivity indicator.
 9. A method according to claim 8, wherein generatingtext data representative of speech within the content of the material inaccordance with the activity indicator includes generating text datarepresentative of any one of first words of a sentence, a first activityof a speaker, or speech at an end of a sentence.
 10. A method accordingto claim 8, wherein generating text data representative of speech withinthe content of the material in accordance with the activity indicatorincludes generating text data representative of a point of interest in aspeech.
 11. A method according to claim 1, wherein generating metadatain accordance with the activity indicator includes generating picturestamps in accordance with the activity indicator.
 12. A method accordingto claim 1, wherein generating metadata in accordance with the activityindicator includes generating a pictorial representation of the contentof the video and/or audio material in accordance with the activityindicator.
 13. A method according to claim 1, wherein generatingmetadata in accordance with the activity indicator includes generatingsemantic metadata in accordance with the activity indicator.
 14. Anapparatus for recording video and/or audio material on a recordingmedium, comprising: a first generator configured to generate anidentifier for identifying a piece of video and/or audio materialrecorded on the recording medium; a second generator configured togenerate an activity indicator representative of relative activitywithin a content of the video and/or audio material; and a thirdgenerator configured to generate metadata in accordance with theactivity indicator.
 15. An apparatus according to claim 14, wherein thesecond generator generates the activity indicator such that a higherthan average degree of motion within the content of the video and/oraudio material is indicated when the material is video.
 16. An apparatusaccording to claim 14, wherein the second generator generates theactivity indicator such that an event of activity within the content ofthe video material is indicated when the material is video.
 17. Anapparatus according to claim 16, wherein the second generator generatesthe activity indicator such that movement into or out of a field of viewof the content of the video material is indicated within the content ofthe video material.
 18. An apparatus according to claim 14, wherein thethird generator generates metadata including a pictorial representationof the content of the material when the material is video.
 19. Anapparatus according to claim 14, wherein the third generator generatesmetadata including at least one of a start of a period of dialogue or achange in a scene of the content of the material.
 20. An apparatusaccording to claim 14, wherein the third generator generates metadataincluding at least one of an introduction of a new face or new faceposition within a scene of content of video material.
 21. An apparatusaccording to claim 14, wherein the third generator generates metadataincluding text data representative of speech within the content of thematerial.
 22. An apparatus according to claim 21, wherein the thirdgenerator generates metadata including text data representative of anyone of first words of a sentence, a first activity of a speaker, orspeech at an end of a sentence.
 23. An apparatus according to claim 21,wherein the third generator generates metadata including text datarepresentative of a point of interest in a speech.
 24. An apparatusaccording to claim 14, wherein the third generator generates metadataincluding picture stamps.
 25. An apparatus according to claim 14,wherein the third generator generates metadata including a pictorialrepresentation of the content of the video and/or audio material.
 26. Anapparatus according to claim 14, wherein the third generator generatesmetadata including semantic metadata.